BULLETIN  OF  THE  EXTENSION 
DIVISION,  INDIANA  UNIVERSITY 


Vol.  VIII 


BLOOMINGTON,  IND. 


No.  11 


„     «■{•£ 


Tenth 

Conference  on  Educational 

Measurements 


JULY,  1923 


Entered  as  second-class  mail  matter,  October  15,  1915,  at  the  post-office  at  Bloom- 
ington,  Indiana,  under  the  Act  of  August  24,  1912.  Published  monthly  by  Indiana 
University,   from  the  University  Office,   Bloomington,   Indiana. 


For  Sale  by  the  University  Bookstore,  Bloomington,  Ind. 

Price  50  cents 

A  limited  number  of  copies  of  this  bulletin  will  be  distributed  free 
of  charge  to  citizens  of  Indiana. 


TENTH  ANNUAL  CONFERENCE 


ON 


EDUCATIONAL 
MEASUREMENTS 


Held  at  Indiana  University,  Bloomington,  Ind.,  Friday 
and  Saturday,  April  20  and  21,  1923 


Published  By 
The  Extension  Division  op  Indiana  University 

1923 


Contents 


Pack 
The  Work  of  the  Bureau  of  Cooperative  Research,  Indiana  Univer- 
sity.    By  Henry  Lester  Smith,  Dean  of  the  School   of   Education, 
Indiana    University '.i 

The  Effect  of  Certain  Kinds  of  Drill  Exercises  on  Comprehension. 
By  Grover  H.  Alderman,  Associate  Professor  of  Elementary  Educa- 
tion,   Indiana  University 12 

How  to  Interpret  and  Make  Use  of  Mental  Tests.     By  Herman  II. 

Young,  Associate  Professor  of  Psychology,  Indiana  University 26 

Improving  Instruction  thru  Educational  Measurement.     By  Frank 

VV.  Ballon.  Superintendent  of  Schools,  Washington,  D.C 44 

Some  Dancers  of  Scientific  Measurement  to  be  Avoided  (Abstract). 

By  Frank  W.  Ballon 82 

Education  and  Our  Responsibility  for  its  Improvement.     By  Frank 

W.    Ballon 94 

Selling  Educational  Research  to  a  Large  City  School  System.  By 
Robert  II.  Lane.  Assistant  Superintendent  of  Schools,  Los  Angeles, 
California 102 

The  Use  of  Educational  Research  and  Experimentation  in  School 
Administration.  By  Ellis  U.  Graff,  Superintendent  of  Indianapolis 
Schools 109 

The  Use  of  Eon  vitonal  Research  and  Experimentation  in  School 
Administration.  By  A.  C.  Senour,  Director  of  Department  of 
Measurements,  East  Chicago  (Ind.)  Schools 117 


The  Work  of  the  Bureau  of  Cooperative 
Research,    Indiana  University 


Henry  Lester  Smith.  Dean  of  the  School  of  Education,  Indiana 

I  ')iircrsitj/ 

The  work  of  the  Bureau  of  Cooperative  Research,  Indiana  Univer- 
sity, for  the  current  year  is  organized  along  nine  lines  of  activity. 

In  describing  these  various  lines  I  have  felt  free  in  three  or  four 
instances  to  use  the  wording  of  the  person  directly  responsible  for  the 
study.  Particularly  is  this  true  in  the  studies  in  Latin,  methods  of 
teaching  reading,  tests  in  home  economics,  and  nursing. 

The  first  of  these  lines  is  that  of  cooperation  with  educational  foun- 
dations in  carrying  on  research  work.  At  present  two  studies  are  being 
conducted  in  an  effort  to  further  the  program  of  investigation  recently 
inaugu rated  by  the  American  Classical  League.  These  two  studies  have 
to  do  with  the  percentage  of  the  most  commonly  used  words  in  the  Eng- 
lish language  that  are  derived  from  Latin.  One  person  is  working  on 
the  first  5,000  words  of  the  Thorndike  list  of  the  most  commonly  used 
words  and  another  is  working  on  the  second  5,000  words  of  the  same  list. 

The  only  scientific  way  of  determining  such  a  Latin  vocabulary  is, 
first,  by  a  word  count  of  the  English  wTords  used  most  often  in  the  most 
widely  read  reading  matter,  and,  secondly,  by  an  etymological  analysis 
of  the  English  words  in  the  list  so  obtained. 

Dr.  Thorndike  is  now  determining  the  25,000  English  words  most 
frequently  used.  With  his  cooperation  the  committee  in  charge  of  the 
classical  investigation  will  have  at  their  disposal  his  list  for  etymological 
analysis  as  a  basis  for  determing  which  Latin  words  are  most  important 
in  interpreting  the  largest  number  of  English  words. 

Dr.  Thorndike's  English  word  count  had  already  progressed  far 
enough  a  year  ago  last  fall  that  he  wTas  enabled  to  publish  a  list  of  the 
10,000  English  words  most  often  used  in  printed  and  written  discourse. 
The  list  of  10,000  words  already  published  is  to  be  found  in  the  Teacher's 
Word  Book  published  by  Teachers  College,  Columbia  University,  1921. 

The  scope  of  the  word  count  used  in  determining  this  list  of  10,000 
words  may  be  most  briefly  stated  by  quoting  the  first  paragraph  from 
the  instructions  for  using  the  Teacher's  Word  Hook.  1  quote  as  follows: 
"The  Teacher's  Word  Book  is  an  alphabetical  list  of  the  10,000  words 
which  are  found  to  occur  most  widely  in  a  count  of  about  025,000  words 
from  literature  for  children;  about  8,000,000  words  from  the  Bible  and 
English  classics;  about  300,000  words  from  elementary  school  texc- 
books;  about  .",0,000  words  from  books  about  farming,  cooking,  sewing, 
the  trades,  and  the  like;  about  90,000  words  from  the  daily  newspapers; 
and  about  500,000  woid-  from  correspondence.  Forty-one  different 
source-  were  used."  This  totals  about  4,505,000  words  in  the  count  upon 
which  the  li-t  of  10,000  words  is  based. 


Bulletin  of  the  Extension  Division 

Each  of  the  words  in  the  list  of  10,001)  has  been  given  an  index  num- 
ber weighted  to  show  how  frequently  it  occurs  and  how  widely  it  is  dis- 
tributed in  the  forty-one  sources  upon  which  the  count  is  based,  when 
compared  with  the  frequency  of  occurrence  and  latitude  of  distribution 
of  the  other  words  in  the  list. 

With  this  list  of  10,000  words  as  a  basis,  it  is  possible  to  form  a 
list  of  Latin  words  that  are  most  important  in  interpreting  the  largest 
number  of  English  words  derived  from  Latin  that  are  in  common  use. 
To  form  this  Latin  list  it  is  necessary  to  find  the  Latin  source  word  for 
each  of  the  English  words  derived  from  the  Latin.  To  determine  the 
place  of  each  of  the  Latin  words  in  the  list  in  order  of  relative  impor- 
tance, it  is  necessary  to  compare  the  number  of  English  derivatives  of 
each  of  the  Latin  words  and  also  the  sums  of  the  index  numbers  of  the 
English  derivatives  of  each  of  the  Latin  words.  Thus  the  Latin  word 
pars  whose  English  derivative  part  has  an  index  number  of  145  would 
rank  in  the  Latin  list  some  distance  ahead  of  pci-sotm  whose  derivative 
person  has  an  index  number  of  76. 

Latin  teachers  will  naturally  be  interested  to  know  how  many  of 
tlie  Latin  words  in  this  Latin  list  are  also  to  be  found  in  Lodge's  list  of 
2,000  words  most  important  to  the  traditional  course  in  secondary  school 
Latin.  It  is  proposed  to  indicate  the  Latin  words  that  are  common  to 
both  lists  in  the  bulletin  which  is  to  embody  the  results  of  this  study. 

In  regard  to  the  1,000  English  words  ranking  highest  by  the  index 
numbers  in  Thorndike's  English  word  list,  it  is  now  possible  to  give  some 
interesting  figures.  Of  the  first  500  words  ranking  highest  by  index 
numbers,  82,  or  17  per  cent,  are  Latin  derivatives.  Of  the  82  Latin 
source  words,  49,  or  approximately  00  per  cent,  are  also  included  in 
Lodge's  list  of  2,000  Latin  words.  Of  the  second  500  words,  204,  or  41 
per  cent,  are  derived  from  Latin.  Of  the  Latin  source  words  of  these 
204  derivatives,  103,  or  approximately  51  per  cent,  are  also  included  in 
Lodge's  list.  Summing  up  for  the  first  thousand,  286  words  are  derived 
from  the  Latin,  or  approximately  2!)  per  cent.  Of  these  286  Latin  source 
words,  152,  or  53  per  cent,  are  also  included  in  Lodge's  list. 

As  I  stated  above,  two  persons  are  at  present  engaged  in  making  a 
study  of  the  Thorndike  word  list  along  the  lines  previously  mentioned. 
The  School  of  Education  and  the  Department  of  Latin  here  at  Indiana 
University  plan  to  issue  a  bulletin  in  cooperation  containing  the  results 
of  the  study.  As  at  present  planned,  the  bulletin  is  to  consist  of  two 
parts.  Part  I  will  contain  three  lists.  List  1  in  Part  I  will  consist  of  the 
list  of  10,000  English  words  alphabetically  arranged  just  as  they  are  in 
Thorndike's  Teacher's  Word  Book  but,  in  addition  to  a  second  column 
after  the  column  of  words  on  each  page  in  which  index  numbers  of  the 
words  are  given,  there  will  be  two  columns  more.  In  the  third  column 
etymological  history  of  the  word  will  be  indicated.  In  the  fourth  column  the 
Latin  or  Greek  source  word  will  be  given  when  any  occurs.  List  2  in  Part 
1  will  contain  in  the  first  column  the  list  of  Latin  source  words  alpha- 
betically arranged;  in  Column  2  the  sum  of  the  frequency  index  numbers 
of  the  English  derivatives  of  each  Latin  source  word;  in  Column  3,  a 
number  indicating  the  rank  of  the  Latin  source  words  as  compared  with 


Educational  Measurements  5 

the  other  Latin  source  words  on  the  basis  of  the  sums  of  the  frequencies 
of  index  numbers  of  the  English  derivatives  of  each;  in  Column  4,  a 
mark  indicating  whether,  if  it  happen  to  be  common  to  Lodge's  List  of 
2,000,  it  is  important  in  the  study  of  Caesar,  or  of  Cicero  or  of  Vergil; 
in  Column  5,  a  list  of  the  English  derivatives  of  each  of  the  Latin  source 
words. 

List  3  in  Part  I  will  contain  in  Column  1  the  list  of  Greek  source 
words  alphabetically  arranged;  in  Column  2,  the  sum  of  the  frequency 
index  numbers  of  the  English  derivatives  of  each;  in  Column  ?>,  a  num- 
ber indicating  the  rank  of  each  of  the  Greek  words  in  the  list;  in  Col- 
umn 4,  the  Latin  derivatives  of  each  of  the  Greek  words  which  are  to 
be  found  in  the  Latin  list  previously  given,  when  there  are  any  such; 
and  in  Column  5,  a  list  of  the  English  derivatives  of  each  Greek  word. 

Part  II  of  the  bulletin  as  at  present  planned  will  consist  of  three 
subdivisions.  Subdivision  1  is  to  contain  a  statement  of  the  per  cent 
value  of  Greek,  of  Latin,  and  of  each  other  important  source  language 
in  giving  increased  understanding  of  English  as  disclosed  by  comparing 
(a)  the  sums  of  the  frequencies  or  index  numbers  of  the  English  words 
derived  from  each  of  the  languages  considered;  and  (6)  the  actual  num- 
bers of  source  words  of  English  derivatives  in  each  language. 

Subdivision  2  in  Part  II  is  to  contain  a  statement  of  the  value  of 
the  mastery  of  Lodge's  list  of  2,000  words  in  giving  an  understanding  of 
the  Latin  element  in  the  English  language.  Subdivision  3  in  Part  II  is 
to  contain  graphs  showing  the  relative  standing  of  the  English,  French, 
Latin,  and  Greek  elements  in  Thorndike's  list,  taking  the  words  in  order 
of  frequency  index  numbers  by  units  of  100  in  case  of  the  first  500  words 
and  by  units  of  500  in  case  of  the  remaining  words  of  the  first  5,000  and 
perhaps  by  units  of  1,000  words  in  the  case  of  the  second  5,000. 

The  second  line  of  work  that  is  being  undertaken  this  year  by  the 
Bureau  aims  to  determine  better  procedure  in  instruction.  A  somewhat 
elaborate  study  along  this  line  in  its  application  to  the  field  of  silent 
reading  has  been  undertaken  this  year  by  Grover  H.  Alderman  of  the 
faculty  of  the  School  of  Education.  A  detailed  report  of  this  study  will 
be  made  by  Mr.  Alderman  this  afternoon.  The  aim  of  his  study  is  to 
measure  the  relative  values  of  certain  types  of  drill  exercises  for  com- 
prehension in  silent  reading.  Many  of  you  are  familiar  not  only  with 
the  aim  but  with  the  method  of  the  study,  since  your  schools  have  been 
cooperating  in  the  work.  For  the  benefit  of  those  who  are  not  familiar 
with  the  plan,  the  following  detailed  information  is  given  concerning  the 
method  used  in  the  experiment: 

"In  order  to  determine  what  types  of  exercises  to  use,  a  very  care- 
ful and  critical  examination  of  all  the  experimental  studies  now  pub- 
lished was  made  in  order  to  note  the  remedial  exercises  recommended 
by  the  various  authorities.  Most  of  the  exercises  recommended  by  the 
various  authorities  group  themselves  around  a  limited  number  of  out- 
standing types  of  remedial  exercises.  For  this  reason,  the  three  follow- 
ing types  of  exercises  were  chosen  for  experimentation: 


<;  Bulletin  of  the  Extension  Division 

1.  Improving   comprehension   ability   by   increasing-  the   child's   reading 

vocabulary.  This  was  done  by  giving  synonyms  and  idiomatic  ex- 
pressions plus  the  proper  use  of  these  words  in  sentences.  A  note- 
book of  all  new  words  acquired  by  the  child  was  kept. 

2.  Improving  comprehension  ability  by  means  of  improving  the  child's 

ability  not  only  to  organize  the  important  thoughts  of  the  para- 
graph  around  a  central  unit  but  also  to  organize  the  central  units 
of  the  entire  selection  in  a  logical  order. 
.",.  Improving  comprehension  ability  by  means  of  improving  the  child's 
ability  to  remember  central  paragraph  thoughts.  This  was  done 
by  means  of  rigorous  testing  over  materials  which  had  been  read 
not  only  that  day  but  material  which  had  been  read  from  two  to 
ttii  ee  weeks  past. 

Practice  periods  were  given  :'.()  minutes  each  day  thruout  a  period 
of  six  weeks.  Comprehension  was  tested  at  the  end  of  the  third  and 
sixtb  week  by  means  of  the  Thorndike-McCall  Silent  Reading  Scale. 

As  a  means  of  determining  whether  speed  might  result  as  a  by- 
product, the  Courtis  Silent  Reading  Test  was  given  at  the  beginning  and 
at  the  end  of  the  experiment  in  the  schools  of  Indianapolis. 

There  were  6  schools,  75  teachers,  and  3,201  pupils  in  grades  from 
4  to  8  cooperating.  Of  these,  1,010  were  drilled  on  Type  1;  1,208  on 
Type  2;  08::  on  Type  3. 

A  second  study  in  methods  of  teaching  is  this  year  being  under- 
taken by  the  Bureau  of  Cooperative  Research  in  the  field  of  primary 
reading.  In  the  practice  school  in  the  grades  maintained  jointly  by  the 
University  and  by  the  public  schools,  some  special  methods  of  presenting 
the  subject  of  primary  reading  are  being  tried  out.  An  effort  was  made 
last  year  and  will  also  probably  be  continued  this  year  to  determine  the 
result  of  these  methods  in  so  far  as  standardized  reading  tests  will 
reveal  such  results. 

A  third  line  of  work  that  the  Bureau  has  been  engaged  in  this  year 
is  in  the  field  of  developing  educational  tests.  One  of  these,  the  Van- 
nest  History  Test,  was  finally  revised  this  year  and  reprinted  in  a  more 
permanent  form  than  formerly.  In  the  revision,  standards  of  achieve- 
ment are  included. 

A  second  standardized  test  which  the  Bureau  has  been  working  on 
this  year  is  the  development  of  a  measuring  scale  in  foods.  The  initia- 
tive in  this  study  was  taken  by  Miss  Florance  Beeson  King,  of  the  De- 
partment of  Home  Economics,  and  was  reported  on  a  year  ago  at  the 
educational  measurements  conference.  A  brief  summary  of  this  study 
is  as  follows : 

"Tests  were  started  in  the  fall  of  1022,  at  which  time  there  were 
no  tests  in  foods  published. 

Purpose:  1.  To  suggest  the  broader  and  more  desirable  subject- 
matter  which  should  be  included  in  foods  courses  and  to  standardize  this 
material  to  some  extent. 


Educational  Measurements  7 

2.  To  set  standards  of  attainments  to  make  possible  the  comparison 
of  individual  pupils,  sections,  or  school  systems  in  their  food  work. 

3.  To  enable  the  teacher  to  measure  the  efficiency  of  her  teaching. 

4.  To  enlighten  the  teacher  as  to  the  pupil's  ability  to  handle  sub- 
ject-matter  and   their   difficulties. 

Mel  hod  of  Procedure:    1.    To  find  subject-matter  for  tests. 

a.  Analyzed  for  content  six  of  the  nine  textbooks  which  were 
found  by  the  University  of  Chicago  survey  to  be  used  by  90  per  cent  of 
teachers  of  the  United  States  and  three  others  published  since  then. 

b.  Consulted  articles  on  home  economics  in  various  educational 
magazines. 

c.  Consulted  a  number  of  courses  of  study. 

d.  Discussed  this  subject-matter  with  several  teachers  of  foods  in 
elementary  and  secondary  schools,  several  university  instructors  in  foods, 
and  one  in  vocational  home-making. 

2.  Arrangement   of   subject-matter. 

a.  Design  of  test.  Simple  checking  test,  with  method  of  right  or 
wrong  scoring,  covering  material  involving  information  and  judgment. 

3.  Preliminary  test. 

a.  Given  to  group  of  instructors.  All  those  which  were  ambiguous 
or  in  which  there  was  any  difference  of  opinion  thrown  out. 

b.  Given  to  100  pupils  of  a  junior  and  senior  high  school.  All 
found  to  be  unscorable  or  otherwise  unsatisfactory  were  eliminated.  Ar- 
ranged in  order  of  increasing  difficulty  for  that  group. 

c.  Given  to  a  group  of  50  pupils,  25  answering  one  set  arranged  in 
order  of  increasing  difficulty,  25  answering  a  set  arranged  in  increasing 
difficulty  within  associated  groups.     First  set  most  satisfactory. 

d.  Given  with  above  arrangement  to  about  5,000  pupils  from  sixth 
to  twelfth  grades  in  Illinois,  Nebraska,  Ohio,  Michigan,  Arkansas,  and 
Indiana. 

Data  from  (d)  are  now  being  compiled.  They  will  be  used  to  give 
a  score  value  to  each  exercise,  so  that  they  may  be  arranged  into  a 
tentative  scale. 

We  also  hope  to  form  tenative  standards  of  attainment  for  7A  and 
7B  and  8A  and  8B." 

The  beginning  steps  were  also  taken,  looking  toward  the  develop- 
ment of  a  scale  of  measurement  in  the  field  of  mechanical  drawing  since 
this  subject  is  offered  in  the  secondary  schools  in  the  state  of  Indiana. 
A  secondary  purpose  in  this  study  is  to  investigate  the  subject-matter 
in  the  mechanical  drawing  curricula  in  Indiana  and  to  determine  the 
methods  that  are  used  in  teaching  the  subject  in  the  high  schools  of  the 
state.  On  account  of  the  absence  of  Mr.  Treat,  who  had  this  study  in 
charge,  little  progress  has  been  made  on  it  during  the  current  year. 

Miss  Florence  Blazier,  of  the  Department  of  Home  Economics,  of  In- 
diana University,  has  undertaken  a  research  study  for  the  Bureau  of 
Cooperative  Research  in  a  fourth  field,  namely,  that  of  nursing.  This 
study  has  been  undertaken  with  the  purpose  in  mind  of  analyzing  this 
field  as  an  occupation  for  girls. 


Bulletin  of  the  Extension  Division 

The  situation  has  been  studied  with  a  view  to  furnishing  informa- 
tion to  two  groups  of  people:  (1)  the  vocational  counsellor  who  wishes 
to  advise  girls  as  to  their  future  work;  (2)  the  high  school  girl  who  is 
contemplating  entering  a  school  for  nurses. 

The  method  of  procedure  has  been  as  follows: 

1.  A  critical  study  of  all  available  literature  on  the  subject  of 
nursing  education. 

2.  A  study  of  the  extent  and  nature  of  the  field,  by  visiting  typical 
hospitals,  training  schools,  public  health  centers,  etc. 

:i.  Study  of  the  results  of  a  questionnaire  sent  to  1,100  graduate 
nurses  in  Indiana,  who  are  engaged  in  all  types  of  nursing,  such  as 
private  duty,  institutional,  and  public  health. 

The  questionnaire  deals  with  such  points  as  educational  qualifica- 
tions, including  graduate  study,  age,  salaries,  with  reasons  for  unem- 
ployment and  statements  of  opinions  as  to  the  physical  and  personal 
qualities  necessary  for  success. 

4.  Study  of  the  judgments  of  leaders  in  the  field  of  nursing  who 
have  been  consulted  in  evaluating  work  of  certain  items  in  the  ques- 
tionnaire. 

In  carrying  on  this  study,  it  has  been  very  gratifying  to  find  that 
superintendents  of  nurses  in  some  of  the  largest  hospitals,  heads  of 
visiting  nurse  associations,  and  state  nursing  officials  have  granted  us 
their  fullest  cooperation. 

This  year  completes  the  gathering  of  data  for  Professor  W.  W. 
Black's  study  in  Handwriting,  carried  on  under  the  auspices  of  the  Bu- 
reau of  Cooperative  Research. 

In  May,  1017,  writing  samples  from  the  2A  grades  of  40  Indiana 
cities  were  sent  to  the  Bureau.  Since  that  date,  samples  have  been  re- 
ceived each  year  from  all  or  nearly  all  of  the  same  cities.  The  Bureau 
now  has  samples  from  grades  2A  to  7A.  This  year  samples  will  be 
collected  from  the  8A  grades.  It  is  believed  that  when  this  material  is 
worked  up  it  should  furnish  valuable  information  on  a  number  of  points. 

1.  A  study  of  the  handwriting  of  a  large  number  of  individual 
pupils  year  by  year  thru  the  seven  years  should  give  valuable  informa- 
tion on  pupils'  rate  of  progress  in  quality  and  speed  and  the  points  of 
acceleration  and  slump,  if  any. 

2.  A  study  will  be  made  of  the  question  of  when  to  discontinue 
formal  lessons  in  writing. 

•'!.  It  is  the  intention  to  make  a  comparative  study  of  the  hand- 
writing of  pupils  from  cities  which  employ  supervisors  or  special  teachers 
of  writing  with  those  from  cities  in  which  writing  is  taught  by  the  reg- 
ular room  teacher. 

4.  If  one  or  a  few  of  the  cities  should  stand  far  above  the  others, 
a  study  will  be  made  of  the  plan  and  method  used  in  the  schools  of  these 
cities. 

5.  The  study  should  give  at  least  a  fair  estimate  of  how  well  writ- 
ing is  taught  in  Indiana  cities  in  comparison  with  the  standai'ds  set  for 
the  whole  count ry. 

A  sixth  line  of  work  which  the  Bureau  is  undertaking  this  year  is 
thai    of   assisting  individuals   and   organizations  in  the   state  who   wish 


Educational  Measurements  9 

help  along-  various  lines  in  which  the  Bureau  is  especially  prepared  to 
render  help.  Several  superintendents,  during  the  year,  have  written  in 
to  the  Bureau  for  bibliographies  or  for  information  concerning  stand- 
ardized tests  in  the  fields  of  measurement  of  achievement  and  measure- 
ment of  intelligence.  We  are  engaged  at  the  present  time  in  working  up 
a  bibliography  of  school  surveys.  This  is  being  developed  in  its  present 
form  at  the  request  of  one  of  the  libraries  in  the  state.  We  hope  later  to 
get  it  in  shape  for  publication  and  general  distribution. 

A  third  method  of  cooperation  with  individuals  and  institutions  out 
in  the  state  as  carried  on  this  year  is  illustrated  by  the  cooperative 
study  that  Indiana  University  is  making  for  the  State  Department  of 
Public  Instruction.  This  is  a  study  of  the  actual  combinations  of  sub- 
jects that  high  school  teachers  are  this  year  teaching  in  our  own  state. 
The  purpose  of  the  study  is  that  it  may  serve  as  a  basis  for  some  of 
the  constructive  recommendations  that  the  State  Department  of  Public 
Instruction  will  make  to  the  State  Board  of  Education  touching  the  ad- 
ministration of  the  new  teachers'  license  law.  It  was  thought  that  defi- 
nite concrete  information  as  to  the  actual  present-day  demands  in  our 
state  along  these  lines  would  be  suggestive  in  the  development  of  the 
State  Board  of  Education  policy  of  licensing  high  school  teachers  in  the 
future.     It  is  hoped  that  this  brief  study  may  be  published  soon. 

Another  field  in  which  the  Bureau  has  been  actively  engaged  this 
year  is  that  of  giving  mental  tests.  Mr.  Z.  M.  Smith,  State  Director  of 
Vocational  Education  in  Indiana,  is  conducting  this  study  under  the 
direction  of  the  Bureau  of  Cooperative  Research.  The  Indiana  Univer- 
sity mental  survey  test  has  been  given  in  connection  with  this  study  to 
1,500  boys  in  vocational  agricultural  schools  in  Indiana  and  to  an  equal 
number  of  non-vocational  boys  in  the  same  schools  and  of  the  same 
school  grades  as  the  vocational  boys.  The  same  test  has  been  given  also 
in  the  states  of  Ohio,  New  York,  Minnesota,  South  Dakota,  California, 
Iowa,  Illinois,  and  Kansas.  The  plan  was  to  give  the  test  to  a  minimum 
of  100  boys  in  vocational  agricultural  schools  in  each  of  these  states 
and  to  an  equal  number  of  non-vocational  boys  in  these  states.  The 
idea  is  to  use  these  results  in  an  effort  to  determine  any  group  differ- 
ences in  mental  ability  as  far  as  this  test  is  able  to  reveal  differences 
in  native  capacity.  This  study  will  ultimately  serve  as  a  portion  of  a 
larger  study  being  undertaken  by  Mr.  Smith  entitled  "The  Work  of  the 
Teacher  of  Vocational  Agriculture".  This  topic  he  hopes  to  complete 
in  partial  fulfilment  of  the  requirements  for  the  degree  of  doctor  of 
philosophy. 

A  further  service  that  the  Bureau  of  Cooperative  Research  has  con- 
tinued to  render  this  year  as  it  had  been  doing  in  previous  years  is  to 
prepare  and  keep  available  an  exhibit  of  educational  tests.  We  now  have 
282  tests. 

The  most  ambitious  study  that  the  Bureau  has  undertaken  this  year 
is  that  of  compiling  a  bibliography  of  educational  tests.  This  has  been 
a  tedious  and  a  long-drawn-out  task.  It  seemed,  however,  that  there 
was  need  for  such  a  study.  I  have  understood  only  recently  that  a  some- 
what similar  study  is  being  undertaken  by  the  Bureau  of  Cooperative 


Bulletin  of  the  Extension  Division 

Research  at  Ohio  State  University  for  the  National  Bureau  of  Education 
at  Washington,  D.C.  Our  publication,  however,  is  intended  to  be  a  little 
more  elaborate  than  the  proposed  bibliography  to  be  published  by  the 
Bureau  of  Education.  We  have  the  material  collected  and  the  data  in 
all  cases  will  cover  the  following:  (1)  title  of  test,  (2)  author  of  the 
test,  (3)  date  of  publication  of  the  test,  (4)  purpose  of  the  test,  (5) 
range  of  grades  covered  by  the  test,  (6)  time  necessary  to  give  the  test, 
(7)  time  necessary  to  score  the  test,  (8)  a  statement  as  to  whether  the 
test  has  been  standardized  or  not  and  if  so,  for  what  time  of  the  year 
the  standards  apply,  (9)  a  brief  description  of  the  test.  This  descrip- 
tion is  intended  to  give  a  fairly  adequate  idea  of  the  test — for  identifi- 
cation purposes  and  enough  to  serve  as  a  basis  for  a  general  notion  of 
the  test. 

Following  this  description  is  a  statement  as  to  the  publishers  of  the 
test,  together  with  the  price  at  which  the  test  is  quoted.  Thus  far  we 
have  listed  in  this  bibliography  301  different  educational  tests.  A  chart 
has  been  drawn  showing  the  distribution  of  these  tests  according  to  the 
years  in  which  they  were  published.  The  publication  of  standardized 
tests  and  scales  of  measurement  started  in  this  country  in  1908  and 
gradually  increased  up  thru  the  year  1913,  after  which  there  was  a 
very  rapid  and  continuous  increase.  Our  records  show  that  37  different 
new  standardized  achievement  tests  were  published  in  1922.  Our  record 
also  indicates  that  thus  far  in  the  year  1923  eight  such  tests  have  come 
from  the  press.  These  numbers  relating  to  the  number  of  tests  pub- 
lished in  the  respective  years  do  not  include  a  list  of  65  tests  listed  as 
miscellaneous  and  concerning-  which  the  date  of  publication  is  not  yet 
available  in  our  records. 

Another  graph  of  some  interest  is  made  in  the  form  of  a  tree  with 
its  branches  [shown  with  the  lecture].  Each  branch  represents  the  date 
of  the  first  test  in  a  particular  field;  thus,  there  is  one  branch  to  indi- 
cate the  beginning  of  the  test  movement  in  arithmetic,  another  branch 
to  reveal  similar  information  for  Latin,  another  one  for  language,  etc. 
This  graph  reveals  the  various  subjects  of  the  curriculum  that  the  meas- 
urement movement  has  invaded  together  with  the  dates  when  tests  first 
appeared  in  these  subjects,  as  follows: 

Arithmetic    1908      Grammar    1918 

Handwriting    1910      Geometry 1918 

Composition    1912      Biology    1918 

Drawing   1913      Chemistry   1918 

Physical  Training 1913      Home   Economics    1919 

Reading,  Spelling 1914       Vocational 1919 

Geography    1914      Botany 1919 

History    1914      General    Science    1919 

Algebra    1914      Spanish    1919 

English  Vocabulary   1915      French   1919 

Latin    1915      Journalism    1920 

Physics    1916      Music    1920 

Language    1917      Poetry  Judging 1921 

German    1917      Physiology    1921 


Educational  Mkasuiikmknts 


11 


According  to  subjects,  the  number  of  these  301  tests  distributed  them- 
selves as  follows : 

Reading 40 

Arithmetic    30 

Vocational  24 

Handwriting-    24 

Composition    20 

History    11) 

Spelling    16 

Geography    14 

Rating    13 

Algebra    10 

Latin    10 

Miscellaneous    10 


Grammar    

Language   

Home  Economics 
Physics    


9 
8 
6 
6 


Physical  Training- 
Vocabulary    

Drawing    

French   

General  Science  . . 

Chemistry    

Geometry 

Music    

Botany   

German    

Spanish    

Biology    

Journalism    

Modern  Language 

Physiology    

Poetry  Judging  .  . 


5 
5 

4 
4 
4 
3 

3 
2 
2 
2 
1 
1 
1 
1 
1 


There  is  on  exhibit  on  the  table  at  the  entrance  of  the  auditorium 
a  typewritten  copy  of  the  material  for  this  bibliography  in  so  far  as  we 
have  gathered  the  material.  Any  visitors  are  free  to  examine  this  man- 
uscript for  information  concerning  any  line  of  tests  in  which  they  are 
interested.  It  is  our  hope  that  this  material  will  be  available  in  printed 
form  by  the  opening  of  school  this  coming  fall  if  not  at  an  earlier  date. 


The  Effect  of  Certain  Kinds  of  Drill  Exercises 

on  Comprehension 


Grover    H.    Alderman,    Associate  Professor  of  Elemenary    Education, 

Indiana   I  rniversity 


I.     INTRODUCTION 

My  purpose  for  appearing  before  you  this  afternoon  is  to  report  to 
this  conference  the  results  of  a  study  in  silent  reading  which  has  been 
carried  on  in  various  schools  of  the  state  of  Indiana  under  the  direction 
of  our  Bureau  of  Cooperative  Research. 

For  some  time  we  have  felt  that  one  of  the  big  problems  with  which 
the  average  school  superintendent  or  supervisor  is  confronted  is  the 
selection  of  proper  drill  exercises  in  silent  reading  which  will  prove 
beneficial  when  administered  by  practical  teachers  under  normal  class- 
room conditions.  It  is  true  that  during  the  last  five  years  there  has  been 
published  a  large  number  of  scientific  investigations  in  the  field  of  silent 
reading',  but  a  critical  examination  of  these  studies  reveals  the  fact  that 
while  they  are  of  great  value  to  the  one  interested  in  research,  they  are, 
many  times,  of  little  value  to  the  busy  superintendent  in  the  field. 

This  is  true,  not  because  these  investigations  are  not  scientifically 
compiled,  but  because  the  data  are  obtained  many  times  as  the  result 
of  intensive  training  under  the  direction  of  an  expert,  surrounded  by 
ideal  conditions.  Before  the  superintendent  can  recommend  them,  there- 
fore, he  must  first  make  sure  that  the  investigator  had  the  right  ob- 
jectives in  mind,  and,  second,  that  the  data  were  not  obtained  at  the 
expense  of  some  other  phase  of  reading  which  is  of  equal  importance. 
My  contention  on  this  point  can  well  be  illustrated  in  the  case  of  speed 
in  reading.  Peters  (8)  in  1915  drilled  pupils  in  grades  three  to  six  for 
a  period  of  seven  months.  At  the  end  of  this  period  he  reported  a  gain 
of  18.7  per  cent,  but  with  a  loss  in  comprehension.  C.  T.  Gray  (2)  two 
years  later  reports  that  in  the  case  of  a  single  fifth-grade  pupil  it  was 
possible  to  make  a  gain  as  high  as  217  per  cent.  In  commenting  on  the 
experiment,  however,  Gray  says  "Speed  was  the  aim  here  at  the  ex- 
pense of  comprehension  if  necessary." 

We  believe  with  Thorndike  (10)  when  he  says  "No  mental  function 
has  been  deliberately  practiced  witli  an  eye  to  improving  it  and  with 
proper  opportunity  for  the  law  of  effect  to  operate  without  some  im- 
provement as  a  result",  but,  nevertheless,  the  average  school  superin- 
tendent is  tiot  contented  with  simply  telling  his  teachers  to  teach  silent 
reading  lor  a  certain  period  each  day,  without  telling  them  why  and 
how  it  should  be  done.  The  data,  therefore,  which  will  be  of  most  value 
to  the  superintendent  not  only  tell  him  why  and  how  but  also  how  much. 

One  of  Hie  most  helpful  experiments  in  the  entire  field  of  reading 
was  performed  by  O'Brien  (7)  in  1!>2()  in  the  schools  of  Illhiois.  In  this 
experiment  O'Brien  used  40  grades  of  20  different  schools,  in  !)  different 

(12) 


Educational  Measurements  13 

school  systems.  His  aim  was  to  determine  how  much  improvement  in 
speed  could  be  made  without  any  loss  in  comprehension.  After  two 
months  of  drill,  O'Brien  presented  data  to  prove  that  the  average 
teacher  under  careful  supervision  should  be  able  to  obtain  a  gain  in 
speed  of  31  per  cent  and  at  the  same  time  obtain,  as  a  by-product,  an 
improvement  in  comprehension  ability  of  1.6  per  cent.  By  using-  three 
kinds  of  exercises,  O'Brien  was  able  to  tell  the  effectiveness  of  each. 
Not  only  this  but  he  showed  that  the  possibility  of  improvement  in  rate 
did  not  cease  with  the  sixth  grade  as  had  previously  been  reported  by 
Gray  (3)  in  the  Eighteenth  Year-Book,  but  that  under  normal  condi- 
tions the  greatest  possibility  for  improvement  occurred  in  the  eighth 
grade. 

With  the  hope  of  performing  an  experiment  in  comprehension  which 
wrould  prove  as  helpful  to  the  teacher  as  did  O'Brien's  study  in  speed, 
the  speaker  early  in  the  year  undertook  the  task  of  selecting  three  kinds 
of  drill  exercises  to  use  in  the  schools  of  the  state  for  the  purpose  of 
improving  comprehension  ability  in  silent  reading.  Two  methods  of 
selecting  these  exercises  seemed  plausible.  These  methods  for  want  of  a 
better  name  we  shall  call  the  negative  and  the  positive  method.  The 
negative  method  is  that  method  in  which  a  large  number  of  children 
are  asked  to  read  a  difficult  selection  and  then  answer  a  number  of 
questions  over  the  selection  which  they  have  read.  Their  answers  when 
closely  analyzed  will  indicate  the  causes  for  their  low  scores  in  compre- 
hension. This  method  was  used  by  Thorndike  (10)  in  his  study  of  para- 
graphs and  sentences.  By  examining  the  numerous  answers  given,  he 
concluded  that  the  reason  so  many  wrong  answers  were  given  are: 
(1)  wrong  connections  with  words  singly,  (2)  over  potency  or  under 
potency  of  the  elements,  (3)  failure  to  treat  the  ideas  produced  by  read- 
ing as  provisional  until  they  are  accepted  or  rejected. 

While  this  method  has  its  merits,  the  speaker  realized  that  should 
he  attempt  to  follow  it,  his  investigation  would  require  that  the  papers 
be  scored  by  a  number  of  judges  in  order  to  eliminate  the  personal  judg- 
ment of  the  speaker  in  scoring.  Also  if  this  method  were  followed,  he 
would  not  be  able  to  utilize  the  large  number  of  scientific  investigations 
which  have  already  been  made  in  the  field  of  silent  reading,  especially 
those  which  have  to  do  with  comprehension.  For  this  reason  it  was 
deemed  wdse  to  make  a  very  thoro  examination  of  all  the  scientific 
studies  in  the  field  of  silent  reading  which  gave  emphasis  to  the  subject 
of  comprehension,  in  order  to  learn  whether  there  had  been  any  in- 
vestigations reported  which  might  lend  themselves  to  further  experi- 
mentation. 

1.  Need  lor  Vocabulary  Training.  One  does  not  have  to  delve  very 
deeply  into  the  literature  of  silent  reading  until  he  is  impressed  with 
the  fact  that,  altho  approaching  it  from  different  angles,  all  investigators 
agree  that  one  of  the  chief  causes  for  low  scores  in  comprehension  is  the 
lack  of  an  adequate  vocabulary.  Thorndike  was  the  first  to  point  this 
out  in  his  article  "Reading  as  Reasoning"  (11).  Gray  also  emphasizes 
it  in  his  article  in  the  Twentieth  Year-Book  (4)  in  his  description  of  a 
seventh-grade  boy  who  ranked  low  in  comprehension  because  of  a  limited 


Bulletin  of  the  Extension  Division 

vocabulary.  As  a  result  of  six  weeks  of  training  on  vocabulary,  in 
which  words  were  studied  in  context  for  meaning1,  with  special  emphasis 
placed  on  prefix,  suffix,  and  stem  in  the  case  of  the  most  difficult  words, 
Gray  says,  "Suffice  it  to  say  that  the  special  training  given  in  this  con- 
nection resulted  in  a  •")()  per  cent  deduction  in  error  and  a  gain  in  rate." 
In  comparing  the  progress  of  this  boy  with  that  of  a  boy  who  did  not 
receive  the  special  training,  Gray  says,  "In  comprehension,  the  subject 
of  the  experiment  made  far  greater  progress  than  the  check  pupil,  al- 
tho  at  the  close  of  the  experiment  he  was  still  behind  the  standard  for 
the  class." 

Pressey  and  Pressey  ('.»)  in  making  a  rather  critical  study  of  the 
concept  of  silent  reading  selected  a  large  reading  vocabulary  as  one  of 
the  four  factors  needed  in  the  reformation  of  silent  reading  problems. 
They  say:  "Evidently  a  reading  vocabulary  is  a  factor  most  readily 
acquired  by  reading.  Measurement  of  vocabulary  the  writers  feel  should 
he  a  regular  practice  in  appraising  the  work  of  a  teacher  or  a  school." 
References  like  the  one  just  cited,  with  many  others  of  equal  importance, 
lead  one  to  believe  that  one  of  the  most  important  exercises  for  in- 
creasing comprehension  ability  is  one  which  has  for  its  aim  the  in- 
creasing of  the  child's   reading  and  understanding  vocabulary. 

2.  Need  lor  Training  in  Organization.  A  second  type  of  difficulty 
which  is  frequently  mentioned  by  writers  dealing  with  the  subject  of 
low  scores  in  comprehension  is  the  lack  of  power  to  select  central  para- 
graph thoughts  and  organize  them  in  a  logical  manner.  Most  writers 
fail  to  recognize  the  fact  that  in  order  to  organize  properly  one  must 
have  the  proper  elements  to  organize.  That  children  do  not  possess  the 
ability  to  select  central  paragraph  thoughts  is  pointed  out  by  both 
Germane  (I)  and  Woody  (12).  Woody  gave  his  silent  reading  test  to 
1.2  1)  children  for  the  purpose  of  seeing  how  well  children  were  able  to 
pick  out  central  paragraph  thoughts.  Woody  says,  "Out  of  the  nine 
exercises  in  the  test  the  medium  number  correctly  marked  varied  from 
2.6  in  the  fourth  grade  to  5.5  in  the  eighth  grade.  This  study  substan- 
tiated Germane's  study  that  'most  pupils  in  grades  five  to  nine  do  not 
know  how  to  summarize  the  material  read  but  devote  most  of  their  time 
to  indiscriminate  note  taking.'  " 

Regarding  the  ability  of  pupils  properly  to  organize  central  thoughts 
in  a  logical  manner  Thorndike  (11)  says  "Understanding  a  spoken  or 
written  paragraph  is  then  a  matter  of  habit  connecting  mental  bonds, 
but  these  have  to  be  selected  from  so  many  others,  and  given  relative 
weights  delicately,  and  used  together  in  so  elaborate  an  organization, 
that  'to  read'  means  'to  think'  as  truly  as  it  does  'to  evaluate'  or  'to 
verify'!"  Such  evidences  as  those  cited  by  Germane,  Woody,  and 
Thorndike  led  the  speaker  to  select  for  his  second  type  of  exercise  some 
drill  work  which  tended  to  train  the  child,  first,  to  select  the  central  para- 
graph thought,  and  second,  to  organize  these  thoughts  in  a  logical  and  an 
orderly  manner. 

Need    for  Training   in    Retention   and    Reproduction.     Third   and 

last,   the   speaker   found   that  many   writers  considered   the  inability   to 


Educational  Measurements  15 

retain  and  reproduce  what  had  been  read  as  a  vital  factor  in  successful 
silent  reading.  Horn  ((>)  considers  that  rate,  comprehension,  organi- 
zation, and  retention  are  practically  on  a  par  so  far  as  successful 
silent  reading  is  concerned.  He  points  out  the  fact  that  special  drill 
work  should  be  devised  in  order  to  make  the  child  more  proficient  in 
each  ability. 

One  of  the  most  elaborate  experiments  in  retention  which  has  been 
performed  in  recent  year.-  is  reported  by  Yoakam  (13)  in  the  Twentieth 
Year-Book.  In  this  article  Yoakam  gives  data  to  prove  that  the  ability 
of  the  average  child  to  reproduce  that  which  he  has  read  varies  from 
7.5  per  cent  in  grade  four  to  20.9  per  cent  in  the  eighth  grade.  In  other 
words,  the  average  child  in  the  elementary  school,  in  the  language  of 
central  tendencies,  is  only  able  to  reproduce  approximately  10  per  cent 
of  the  factual  material  as  a  result  of  a  single  reading.  An  exercise  which 
aimed  to  increase  this  ability  was  selected  for  the  third  type  of  drill 
work. 

A  report  of  the  principal  causes  underlying  low  scores  in  compre- 
hension would  not  be  complete  without  a  brief  report  of  an  unpublished 
study  just  completed  by  Dr.  George  Hilliard  (5)  of  the  Western  State 
Normal  School  at  Kalamazoo,  Mich.  After  making  a  thoro  survey  of 
all  the  educational  literature  written  in  the  field  of  silent  reading,  Hil- 
liard lists  six  causes  for  low  scores  in  comprehension  tests.  These  six 
are:  (1)  low  general  intelligence,  (2)  insufficient  vocabulary,  (3)  lack 
of  organization  ability,  (4)  faulty  rate  of  reading,  (5)  inability  to  repro- 
duce, (6)  excessive  lip  movement  and  articulation.  In  order  to  determine 
the  correlation  between  these  six  factors  and  comprehension  ability, 
Hilliard  measured  the  comprehension  ability  of  161  fifth-grade  pupils  by 
the  Monroe,  the  Burgess,  the  Courtis,  and  the  Thorndike-McCall  Scale 
and  correlated  these  results  with  (1)  intelligence  as  measured  by  the 
Terman  Group  Test,  (2)  vocabulary  as  measured  by  the  Thorndike 
Visual,  (3)  organization  as  measured  by  Greene's  Organization  Test, 
(4)  rate  as  measured  by  the  Courtis  tests,  (5)  and  reproduction  as 
measured  by  the  Dutch  Homestead  Test.  In  commenting  on  his  results 
Hilliard  says,  "Comparisons  between  the  scores  on  comprehension  tests 
and  measures  of  the  six  factors  given  above  show  a  positive  relation- 
ship between  them  all."  The  zero  order  of  correlations  gives  the  order 
of  importance  of  these  six  factors  as  follows:  (1)  intelligence,  (2) 
vocabulary,  (3)  organization,  (4)  rate,  (5)  reproduction,  (G)  lip  move- 
ment. 


16 


Bulletin  of  the  Extension  Division 


Type  of  Tests 


[V.     Rate 

Courtis. 


V.     Reproduction 

Dutch  Homestead. 


Table  I. 
(Alter  Hilliard) 


I.     Intelligence 

Terman  ( Iroup 

11.     Vocabulary 

Thorndike,    Visual     Vo- 
cabulary  

III.     Organization 

( Ireene 


Monroe 


r       P.E. 


609 


ILC. 


250 


711 


343 


085 


.097 


126 


.()( 17 


MM 


'1'.  McC. 


.  S21 


.689 


.613 


185 


:,  , 


P.E 


(HI 


071 


OS  I 


id:; 


114 


Burgess 


r       P.E 


900 


.599 


.549 


720 


oi;, 


1)21 


(ISO 


001 


1107 


us:; 


( lourtis 


HHt 


536 


566 


.685 


04:] 


P.E. 


69 


.096 


002 


070 


070 


Note:     The  number  of  cases   (25)    is  so  small  that  unless  the  correlation  is  five  or  six 
limes   the    probable   erroi    it    has   little   significance. 


Since  all  will  agree  that  native  intelligence  does  not  improve  with 
training,  the  three  kinds  of  drill  work  selected  by  the  speaker  coincide 
with  the  ones  selected  by  Hilliard  with  the  exception  of  rate.  While 
rate  was  recognized  by  us  as  an  important  factor  in  improving  com- 
prehension ability,  this  element  was  purposely  omitted  since  we  felt 
that  the  relation  of  rate  to  comprehension  had  been  so  well  solved  by 
O'Brien. 

The  three  types  of  exercises  finally  selected  for  experimentation  are: 

Type  I.  Those  which  aim  to  enlarge  the  child's  reading  and  understand- 
ing vocabulary. 

Type  II.  Those  which  aim  to  improve  the  child's  ability  in  selecting 
central  paragraph  thoughts  and  organizing  them  in  a  log- 
ical manner. 

Type  III.  Those  which  aim  to  improve  the  child's  ability  to  retain  and 
reproduce  the  important  points  which  he  has  read. 

II.     THE    PROBLEM 

The  problem,  therefore,  which  was  finally  selected  for  experimenta- 
tion is  briefly  stated  as  follows: 

4.  Under  normal  classroom  conditions  is  it  possible  to  improve 
comprehension  ability  by  drilling  intensively  over  a  period  of  six  weeks? 

2.  If  comprehension  ability  can  be  improved,  which  kind  of  drill 
exercise  is  the  most  effective? 


Educational  Measurements  17 

iii.    method 

In  order  that  the  experiment  might  be  carried  on  under  normal 
classroom  conditions,  several  schools  of  the  state  were  invited  to  co- 
operate. The  response  to  the  invitation  sent  out  by  Dean  Smith  was 
so  cordial  that  more  schools  responded  than  we  were  able  to  use,  since 
the  expense  incurred,  in  conducting-  such  an  experiment,  made  it  pro- 
hibitive to  use  more  cases  than  was  necessary.  On  the  other 
hand,  there  are  always  so  many  unforeseen  conditions  which  arise 
that  make  it  necessary  to  include  a  large  number  of  cases  to 
offset  any  such  variable.  In  all,  there  were  3,201  pupils  and 
75  teachers  in  grades  four  to  eight  in  the  cities  of  East  Chi- 
cago, Frankfort,  Lebanon,  Mishawaka,  Huntington,  and  Indian- 
apolis who  cooperated  in  this  experiment.  Of  this  number,  1,010  chil- 
dren under  the  direction  of  25  teachers  were  used  in  the  vocabulary  ex- 
periment, 1,208  pupils  under  the  direction  of  25  teachers  were  used  in 
the  organization  experiment,  and  983  children  with  25  teachers  were 
used  in  the  retention  experiment.  The  cause  for  the  unequal  number  of 
children  in  the  various  types  is  due  to  the  fact  that  each  school  system 
worked  on  all  three  types  of  exercises.  This  was  done  to  eliminate  any 
variable  which  might  arise  as  a  result  of  better  teaching  or  better  super- 
vision which  might  occur  in  certain  cities. 

The  instructions  sent  to  the  teachers  were  carefully  explained  in 
every  detail,  covering  thirteen  pages  of  closely  typed  material.  The  aim 
of  the  experiment,  the  method  of  dividing  the  class,  the  method  of 
teaching,  and  the  methods  of  administering  the  tests  were  carefully  ex- 
plained. While  time  does  not  permit  me  to  explain  in  detail  the  various 
instructions  sent  to  the  teachers,  it  does  seem  necessary  to  explain  some- 
what briefly  the  chief  points  of  interest. 

The  Thorndike-McCall  Reading  Scale,  Form  I,  was  used  to  divide 
each  class  into  a  drill  and  a  non-drill  section.  The  teacher  was  advised 
to  divide  the  class  into  two  divisions  of  equal  reading  ability  as  indi- 
cated by  the  T.  Scores  on  the  Thorndike-McCall  Scale. 

After  the  division  had  been  made,  the  teacher  was  directed  to  con- 
duct the  non-drill  group  by  the  same  method  she  had  always  used  with 
the  exception  that  the  recitation  in  all  cases  was  to  be  but  thirty  minutes 
in  length.  It  was  also  suggested  that  she  keep  the  drill  and  the  non- 
drill  group  separated  during  the  recitation  period  with  the  exception  of 
those  days  when  the  Thorndike-McCall  tests  were  to  be  given.  The 
teachers,  as  a  rule,  found  it  impossible  to  comply  with  this  request  since 
neither  the  extra  room  nor  the  extra  teacher  was  available. 

While  no  conferences  were  held  nor  demonstration  lessons  taught, 
as  was  done  by  O'Brien  in  Illinois,  each  teacher  was  furnished  a  five- 
page  mimeographed  pamphlet  which  explained  in  detail  the  method  to 
be  followed  in  conducting  the  drill  class.  While  the  instruction  varied 
with  each  type  of  exercise,  there  were  a  few  points  in  common  which 
applied  to  all  three  types.  First  it  was  explained  to  the  teacher  that 
the  reading  material  to  be  used  by  the  drill  group  should  always  be  of 
a  concrete  nature,  well  organized  and  easily  tested.  It  was  suggested 
that  the  teacher's  use  selections  on  citizenship,  health,  commerce,  geog- 

2— 25ti70 


18  Bulletin  of  the  Extension  Division 

i-aphy,  history,  etc.  In  order  that  the  material  would  present  a  real 
problem  in  comprehension  to  the  child,  it  was  suggested  that  the  selec- 
tions be  taken  from  the  grade  above.  It  was  emphasized  that  neither 
fairy  tales  nor  poetry  was  to  be  used  by  the  drill  group. 

The  second  instruction  which  was  common  to  all  grades  had  to  do 
with  the  method  of  conducting  the  first  day's  recitation.  During  the 
first  half  of  the  first  day's  lesson  each  teacher  was  asked  to  explain  to 
her  class  the  aim  of  the  experiment.  She  was  also  asked  to  prepare  the 
minds  of  the  children  by  explaining  to  them  the  importance  of  compre- 
hension, the  relation  of  comprehension  to  study  habits,  and  the  possi- 
bility of  improving  it.  It  was  further  explained  that  there  were  75 
different  classes  in  the  experiment,  and  that  this  particular  class  must  do 
its  best  since  the  results  were  to  be  published. 

The  second  half  of  the  first  lesson  was  used  to  introduce  the  child 
to  the  type  of  drill  work  he  was  to  use.  In  the  case  of  Type  I,  the 
vocabulary  study,  the  child  was  asked  to  read  silently  a  rather  difficult 
selection  which  was  taken  from  the  grade  above.  After  the  class  had 
finished  reading  the  selection  once,  some  child  was  asked  to  tell  in  his 
own  words  the  important  thoughts  of  the  selection.  As  soon  as  the 
teacher  discovered  the  child's  inability  to  do  this,  she  pointed  out  to  him 
that  his  inability  to  select  central  paragraph  thoughts  was  largely  due 
to  the  fact  that  he  had  failed  to  understand  the  meaning  of  certain 
fundamental  words.  The  teacher  then  proceeded  to  explain  that  during 
the  next  six  weeks  intensive  drill  work  would  be  given  for  the  purpose 
of  building  up  each  child's  reading  vocabulary. 

The  method  used  for  the  vocabulary  study  was  similar  to  that  re- 
ported by  Gray  (4)  in  the  Twentieth  Year-Book  of  the  National  Society 
for  the  Study  of  Education.  The  method  in  brief  is  as  follows:  As  soon 
as  the  child  encountered  a  word  that  he  did  not  understand,  he  looked  it 
up  in  the  dictionary,  not  for  the  purpose  of  learning  its  definition  but 
rather  for  the  purpose  of  learning  several  synonyms  and  idiomatic  ex- 
pressions which  explain  its  meaning.  The  word  with  several  synonyms  and 
equivalent  idiomatic  expressions  and  an  illustrative  sentence  were  then 
recorded  in  the  child's  notebook  which  he  was  advised  to  keep  from  day 
to  day.  For  example,  instead  of  writing  the  definition  of  the  word  "in- 
domitable", the  child  would  simply  record  "indomitable — meaning  brave, 
fearless — idiomatic  expression,  to  look  danger  in  the  face — Sentence,  He 
was  an  idomitable  hero." 

The  method  used  in  Type  II  was  similar  to  that  used  in  Type  I  in 
that  the  first  part  of  the  first  lesson  was  used  in  explaining  the  aim  of 
the  experiment,  the  method  of  approach,  etc.  In  Type  II,  however,  the 
pupil  was  led  to  see  that  one  of  the  chief  causes  for  low  scores  in  com- 
prehension is  due  to  the  fact  that  children  are  not  trained  to  pick  out 
the  important  thoughts  of  a  selection  and  organize  them  in  a  logical 
way.  It  was  further  explained  that  during  the  next  six  weeks  the  class 
would  be  drilled  for  thirty  minutes  each  day,  to  see  if  it  would  not  be 
possible  to  improve  each  child's  ability  in  selecting  central  paragraph 
thoughts  and  organizing  them  in  an  orderly  and  a  logical  manner. 

The  method   used  in  teaching  organization   was  briefly  as  follows: 


Educational  Measurements  19 

The  children  were  asked  to  read  the  selection  rapidly.  After  the  reading 
was  completed,  the  important  thoughts  of  each  paragraph  were  then 
selected  and  written  on  the  board.  The  children  were  then  permitted  to 
read  the  selection  the  second  time  to  correct  any  wrong  impressions 
which  they  had  gained  as  a  result  of  the  first  reading.  After  this  had 
been  done,  the  children  in  cooperation  with  the  teacher  organized  these 
thoughts  in  their  proper  relationship,  in  the  form  of  a  brief,  concise 
outline.  These  outlines  when  completed  were  placed  in  the  children's 
notebooks  in  order  that  they  might  be  used  for  future  review  work. 

The  third  type  of  exercise  differed  from  Type  II  in  that  the  aim  here 
was  to  improve  comprehension  ability  by  means  of  constant  drill  work 
in  the  reproduction  of  important  facts  taken  from  the  reading  lesson. 
The  instructions  in  Type  III  provided  for  the  following:  The  teachers 
were  requested  to  ask  each  pupil  to  read  his  lesson  rapidly,  keeping  in 
mind  that  he  was  ."oon  to  be  submitted  to  a  short,  rigorous  fact  ex- 
amination. As  soon  as  the  class  had  read  the  selection  once,  ten  short 
fact  questions  were  given  which  could  be  answered  in  one  word.  After 
the  test  was  completed  the  papers  were  exchanged  and  corrections  made. 
After  the  papers  had  been  corrected  and  passed  back  to  the  original 
owners,  the  class  was  directed  to  read  the  lesson  the  second  time,  with 
the  hope  of  finding  the  correct  answer  to  each  question.  In  order  to 
facilitate  matters,  the  teacher  was  requested  to  have  the  ten  questions 
written  on  the  blackboard  under  a  map  so  that  the  questions  might  be 
plainly  seen  by  the  class  when  the  map  was  removed. 

After  the  children  had  been  drilled  for  three  weeks  by  one  of  the 
three  methods  described  above,  they  were  given  the  Thorndike-McCall 
Reading  Scale,  Form  II,  to  measure  any  improvement  which  had  been 
made  in  comprehension.  After  this  the  children  were  again  drilled  for 
three  weeks,  and  on  Friday  of  the  sixth  week  Form  III  of  the  Thorn- 
dike-McCall  Scale  was  given  to  all  classes. 

The  testing  program  outlined  above  was  followed  by  all  schools 
with  the  exception  of  Indianapolis.  Here,  on  account  of  a  shortage  of 
Form  I  tests,  Form  III  was  given  first,  followed  by  Forms  II  and  I. 
Theoretically  this  should  make  no  difference  in  the  results  as  all  of  these 
scales  are  of  equal  difficulty  from  a  statistical  point  of  view. 

In  order  that  a  gain  in  comprehension  might  not  take  place  at  the 
expense  of  some  other  element  of  equal  importance,  the  teachers  were 
urged  to  have  all  reading  done  as  rapidly  as  possible,  provided  that  it 
was  not  encouraged  beyond  the  point  of  efficiency.  In  order  to  make 
sure  that  the  children  did  not  lose  in  speed,  500  children  from  the  schools 
of  Indianapolis  were  given  the  Courtis  Silent  Reading  Tests,  Forms  I 
and  II.  It  was  planned  that  Form  I  be  given  on  the  first  day  and  that 
Form  II  be  given  on  the  last  day  of  the  experiment,  but  owing  to  cir- 
cumstances over  which  we  had  no  control  but  three  weeks  intervened 
between   the   administering   of   the   first   and    second   tests. 

IV.      DATA 

The  data  which  1  shall  presenl  today  is  a  compilation  of  the  returns 
from  only  35  teachers.    As  has  already  been  explained,  there  were  in  all 


20 


Bulletin  of  the  Extension  Division 


75  teachers  cooperating  in  this  experiment,  but  on  account  of  spring  va- 
cations one  school  system  started  with  the  experiment  one  week  late, 
and,  as  a  result,  the  returns  from  this  city  were  not  sent  in  to  the 
office  soon  enough  to  include  the  data  for  presentation  today.  The 
report   today,  therefore,  will  give  data  on  only  1,129  pupils,  distributed 

Table  II.— General  Information. 


Type   1. 

II. 

III. 

Total 

13 

203 
195 

398 

13 
210 

202 

112 

9 
103 
150 

35 

Drill 

57(1 

Non-Drill. 

553 

Total 

319 

1,129 

among  the  various  types  as  follows:  398  in  Type  I,  412  in  Type  II,  and 
319  in  Type  III.  The  classes  originally  were  divided  in  such  a  manner 
that  there  were  an  equal  number  in  the  drill  and  the  non-drill  sections. 
The  inequality  shown  in  Table  I  is  due  to  the  fact  that  a  child's  score 
was  not  considered  unless  he  took  both  tests.  As  a  result,  the  number 
of  pupils  in  the  drill  and  the  non-drill  sections  vary  as  is  shown  in 
Table  II. 

As  has  been  previously  stated,  it  was  the  original  plan  to  report  the 
amount  of  improvement  made  after  three  weeks  of  drill  and  again  after 
six  weeks  of  drill.  These  data,  with  the  data  compiled  from  the  Courtis 
tests,  will  be  tabulated  within  the  next  few  weeks,  but  time  did  not 
permit  the  work  to  be  completed  for  presentation  at  this  con- 
ference. The  report  today  gives  data  on  only  two  tests:  Form  I,  which 
was  given  before  the  drill  work  began,  and  Form  III,  which  was  given 
on  the  last  day  of  the  drill  period. 


Table  III.     Average  Gain  by  Sections 


Form 

Drill                                      

6 

50  21 
47  75 

1 

40  40 
40  20 

Average 
Gain 

3.81 

\on-Drill                                  

1.46 

2  30 

Table  III  shows  the  average  gain  in  T.  scores  for  the  drill  and  non- 
drill  groups.  The  average  score  for  the  non-drill  group  before  training 
was  46.29  while  the  average  score  after  training  was  47.75  or  a  gain  of 
L.46.  The  average  score  for  the  non-drill  group  before  training-  was 
46.40  while  the  average  score  after  training  is  50.21,  or  a  gain  of  3.81 


KIUK'ATIONAL    MEASLKLMKNTS 


21 


for  the  drill  group.  This  table  shows  that  the  average  grade  child, 
under  the  teaching  lie  is  receiving  in  these  particular  schools,  gains  nor- 
mally during  a  period  of  six  weeks  1.46  points  on  the  Thorndike-McCall 
Scale,  while  children  of  approximately  equal  reading  ability  gain  3.81 
points  on  the  same  scale,  after  having-  been  subjected  to  intensive  drill 
exercises  for  thirty  minutes  each  day  for  a  period  of  six  weeks. 

While  data  of  the  type  presented  in  Table  III  are  gratifying  to  the 
one  directing  the  experiment,  they  are  of  little  value  to  one  interested  in 
supervision.  If  gains  are  made  the  supervisor  wants  to  know,  first,  which 
particular  teachers  are  responsible  for  the  gains,  and,  second,  which 
type  of  drill  work  proved  most  effective.  Table  IV  give.-,  this  informa- 
tion. 

Table  IV.    Showing  Sections  Which  Received  Bigheh  Scohe 


I. 

7 
6 

13 

11. 

(i 
13 

III. 

7 
2 

9 

Total 

Drill 

Non-Drill 

21 
14 

Total 

35 

In  Type  I,  7  of  the  13  classes  profited  by  the  drill  work,  while  6 
failed  to  profit.  In  Type  II,  the  results  were  similar  to  those  in  Type  I, 
hut  in  Type  III,  while  there  are  but  9  classes  reported,  it  is  important 
to  notice  that  7  of  the  9  sections  profit  by  the  drill  work.  Comparing 
the  drill  sections  with  the  non-drill  sections,  21  of  the  drill  sections 
profited  by  the  drill  work,  while  14  failed  to  profit. 

Since  the  important  element  in  any  teaching  is  the  individual  child, 
the  supervisor  is  always  interested  in  knowing  the  value  of  an  exercise 
not  only  in  the  language  of  central  tendencies,  but  also  in  terms  of  the 
individual  child. 

Table  V.     Number  of  Pupils  Gaining 


Type 

1 

II 

III. 

( rail 

Loss 

Same 

Total 

A 

L51 

17(1 
III 

132 

B 

1 33 

1  i:; 

105 

381 

A 

to 

32 

17 

L19 

B 

15 
52 
13 

1  I'l 

A 

12 
S 
5 

B 

If. 

i 

8 

A 

203 
210 
163 

B 

195 
2(12 

1 5i ; 

Total 

25 

31 

570 

553 

Table  V  is  read  as  follows:  Of  the  20.".  children  who  were  drilled  in 
Type  I  exercise,  151  gained,  10  lost,  and  12  received  the  same  score  in 
both  tests. 


Bulletin  of  the  Extension  Division 

Table  VI . — Percentage  of  Pupils  Gaining 


Types 

I. 

74 
68 

6 

II. 

III. 

Till  :il 

Drill               

Ml 

71 
9 

68 
67 

1 

75 

Non-Drill                     

69 

Drill  over  Non-1  )rill 

6 

Table  V  reduced  to  percentages  shows  that  of  the  398  children  using 
drill  exercise  Type  I,  74  per  cent  of  the  drill  group  profited  while  but 
68  per  cent  of  the  children,  in  the  non-drill  group,  increased  their  scores 
during  the  six  weeks'  period.  Using  the  percentage  of  pupils  as  a 
method  of  determining  the  relative  value  of  these  three  types  of  exer- 
cises, Table  V  shows  Type  II  to  be  first  with  80  per  cent,  Type  I  second 
with  74  per  cent,  and  Type  III  third  with  a  score  of  68  per  cent. 

Data  showing  the  number  of  pupils  that  gain  as  a  result  of  any 
exercise  art'  a  more  accurate  measure  of  a  teaching  method  than  data 
which  show  the  average  gain   for  each   pupil. 

Table  VII.   -Average  T.  Score  Gain   Per  Pupil 


Drill 

1. 

:{  13 

:;  li 

02 

II. 

5.10 

3.80 

III. 
2  99 

Non-Drill 

2.78 

1.30 

.21 

Even  by  the  method  of  average  T.  Score  gains,  Drill  Exercise  Type 
II  would  again  receive  hist  rank. 

In  any  kind  of  drill  work,  the  supervisor  is  always  interested  in 
knowing  the  type  of  pupils  who  are  benefited  by  the  exercise.  A  drill 
exercise  which  is  of  value  to  the  class  as  a  whole  may  prove  a  detriment 
to  the  pupils  in  the  upper  quartile. 


Educational  Measurements 


23 


Table  VIII.     Percentage  <>i   Gain  hy  Qttartiles 


Type  1. 

Q4 

Qi 

Drill 

ill 
in 

80 

Non-Drill   

84 

Type  II. 

Drill. 

01                  02 

Non-Drill 

51                  83 

Type  III. 

Drill 

50                 82 

Non-Drill 

50 

75 

Table  VIII  shows  that  while  the  exercises  used  in  this  experiment 
were  of  some  value  to  the  pupils  of  the  upper  quartile,  they  proved  to 
be  of  most  value  to  the  pupils  of  the  lower  quartile.  It  is  interesting 
to  note  that  from  the  standpoint  of  the  better  readers,  as  measured 
by  the  Thorndike-McCall  Scale,  the  exercises  are  of  value  in  the  order 
named,  while  from  the  standpoint  of  the  slower  pupil  Type  II  ranks 
first,  Type  III  ranks  second,  with  Type  I  ranking  third. 

V.  CONCLUSIONS 

While  one  hesitates  to  draw  any  conclusions  before  all  the  data  are 
tabulated,  the  facts  presented  this  afternoon  seem  to  warrant  certain 
tentative  conclusions. 

1.  Comprehension  ability  in  silent  reading  as  measured  by  the 
Thorndike-McCall  scale  may  be  improved  to  a  degree  equivalent  to 
nearly  one  semester  of  work  in  grades  four  to  eight  by  careful,  sys- 
tematic drill  work  covering  a  period  of  six  consecutive  weeks,  provided 
that  thirty  minutes  each  day  is  devoted  to  this  type  of  work. 

2.  A  teacher  who  is  interested  in  improving  comprehension  ability 
in  silent  reading  as  measured  by  the  Thorndike-McCall  Scale  would  be 
justified  in  using  any  one  of  the  exercises  used   in  this  experiment. 

3.  The  relative  value  of  the  exercises  used  in  this  experiment  vary 
with  the  reading  ability  of  the  pupil.  Type  II,  or  the  organization 
exercise,  is  of  most  value  to  the  average  pupil,  and  also  to  the  pupils 
in  the  lower  quartile,  while  Type  I,  or  the  vocabulary  study,  is  of  most 
value  to  the  pupil  in  the  upper  quartile. 


VI.  AN  EVALUATION 

Anyone   who    has    had    any   experience    in    the-    held   of   experimental 
education  is  aware  of  the   fact    that  the  results  of  this  experiment   are 


I'.ULLETIN    OF   THE    EXTENSION    DIVISION 

by  no  means  startling.  There  are  several  facts,  however,  which  one 
should  keep  in  mind  while  attempting'  to  estimate  its  worth.  This  ex- 
periment was  pei  formed  in  six  of  the  best  school  systems  of  the  state. 
In  these  schools  the  subject  of  silent  leading  had  been  emphasized  thru- 
out  the  entire  school  year.  In  order  to  make  sure  that  the  experiment 
would  not  fall  into  the  hands  of  inexperienced  teachers,  each  superin- 
tendent was  requested  to  permit  only  those  teachers  to  participate  who 
were  especially  interested  in  the  teaching  of  silent  reading.  Had  this 
experiment  been  carefully  supervised  in  schools  in  which  the  subject  of 
silent    leading  had   not  been    stressed,  more  gains  might  have  resulted. 

Altho  the  data  showed  that  the  drill  work  in  organization  proved 
the  most  effective  means  of  improving  comprehension,  it  must  be  re- 
membered that  these  conclusions  were  made  after  comparing  the  scores 
made  by  the  drill  group  with  those  of  the  non-drill  group.  In  our  in- 
structions to  teachers  it  was  suggested  that  the  drill  group  recite  in 
a  separate  room.  The  teacher  as  a  rule  found  it  impossible  to  comply 
with  the  request.  With  this  thought  in  mind  it  is  interesting  to  note 
that  the  work  in  organization,  an  exercise  which  was  performed  largely 
as  an  individual  problem,  proved  to  be  of  most  value  to  the  average 
pupil. 

Again  while  exercises  one  and  three  proved  to  be  of  less  value  than 
did  exercise  two,  the  results  might  have  been  different  had  a  finer 
measuring  scale  been  used.  A  close  examination  of  the  steps  on  the 
Thorndike-McCall  Scale  reveals  the  fact  that  each  unit  on  the  scale 
is  equal  to  an  age  difference  of  from  three  to  six  months.  While  the 
results  of  the  six  weeks  of  practice  may  have  resulted  in  considerable 
gain,  this  gain  may  not  have  been  equal  to  six  months  of  regular  school 
work.  This  study  indicates  that  we  need  further  experimentation  using 
finer  measuring  scales. 

BIBLIOGRAPHY 

1.  Germane,   C.   E.     Outlining  and   Summarizing  Compared  with  Re- 

reading the  Same  Article.    Twentieth  Year-Book.     Part  II,  p.  109. 

2.  Gray,  C.  T.     Type  of  Reading  Ability  as  Exhibited  through  Tests 

and  Late  Experiments.     Sup.  Ed.  Mon.,  Vol.  I,  No.  5,  Aug.,  1915, 
University  of  Chicago. 

3.  Gray,  W.  S.     Eighteenth  Year-Book  of  the  N.S.S.E.  Pt.  II,  p.  28. 

4.  Gray,  W.    S.     Twentieth  Year-Book  of  the  N.S.S.E.   Pt.   II,  p.  49, 

Public   School   Publishing  Co.,  Bloomington,  111. 

5.  Hilliard,  George  H.     Probable  Types  of  Difficulties  Underlying  Low 

Scores  in  Comprehension  Tests.     (An  unpublished  doctor's  disser- 
tation, University  of  Iowa),  1922. 

6.  Horn,    E.      A    Constructive    Program    in    Silent    Reading.      Jr.    of 

Ed.  Research,  May,  1921. 

7.  O'Brien.  John  A.     Silent  Reading.     The  Macmillan  Co.,  1921. 

8.  Peters.   C.    C.      The   Influence  of   Speed   Drills   upon    the   Rate   and 

Effectiveness    of    Silent    Reading.      Jr.    of    Ed.    Psv.,    Vol.    VIII, 
350-66. 


Educational  Measurements  25 

!).     Pressey,  L.  \V.,  and  Pressey,  S.  L.     A  Critical  Study  of  the  Concept 
of  Silent  Reading.    Jr.  of  Ed.  Psy.,  Vol.  XII,  pp.  25-31. 

10.  Thorndike,  E.  L.     Educational  Psychology.     Vol.  II,  p.  151,  Teachers 

College,  Columbia  University,  New  York  City. 

11.  Thorndike,  E.  L.     Reading  as  Reasoning.     Jr.  of  Ed.  Psy.,  Vol.  VIII, 

p.   a&o. 

12.  Woody.  C.   W.     Measuring  of  a   New   Phase  of    Reading.     Unpub- 

lished manuscript. 

13.  Yoakam,  G.  A.     The  Effect  of  a  Single  Reading.     Twentieth  Year- 

Book  of  the  N.S.S.E.,  Pt.  II,  p.  90. 


How    to    Interpret  and    Make   Use  of   Mental 

Tests 


Herman  H.  Young,  AssociaU    Professor  of  Clinical  Psychology,  Indiana 

I  diversity 


A  MENTAL  test  is  merely  a  specially  constructed  arrangement  of 
mental  stunts  to  be  uniformly  presented  to  all  persons  selected  for  test- 
ing. In  psychological  phraseology  a  mental  test  is  a  stimulus  or  series 
of  stimuli  presented  to  an  individual  or  group  of  individuals  to  elicit 
a  verbal  response  or  motor  reaction  under  such  conditions  that  a  perma- 
nent record  thereof  is  made,  either  by  the  individual  who  is  being  tested 
or  by  the  one  doing  the  testing.  Since  we  are  now  interested  primarily 
in  group  mental  tests  it  should  be  recognized  that  a  group  mental  test 
is  a  series  of  stimuli  presented  by  a  predetermined  and  uniform  method 
to  groups  of  individuals  to  elicit  a  written  response  from  each  member 
of  the  group.  Most  group  mental  tests  are  constructed  so  that  the  time 
and  energy  required  in  writing  the  response  is  very  small.  An  individ- 
ual's score  is  largely  the  result  of  Ins  mental  reaction  to  the  test  and 
not  the  result  of  time  and  energy  consumed  in  the  physical  manipula- 
tion of  the  pencil  in  writing  the  response. 

Vitiating,  unstandardized  conditions  which  vary  from  individual 
to  individual  and  from  school  to  school  and  which,  when  present,  seri- 
ously modify  the  scores  of  the  individuals  or  group  affected,  may  be 
classified  under  three  headings:  (1)  physical  conditions  of  the  person 
being  tested,  (2)  failure  on  the  part  of  the  tester  to  present  the  tests 
in  accordance  with  the  standardized  instructions,  and  (3)  particular 
nature  of  the  physical  surroundings  under  which  the  test  is  presented. 

Physical  conditions  which  may  affect  the  score  of  a  person  are  of 
two  types,  temporary  or  permanent,  and  include  such  conditions  as  de- 
fective vision,  defective  hearing,  physical  deformities,  fatigue,  malnu- 
trition, lack  of  sleep,  headache,  and  illness  during  the  test. 

Faulty  presentation  of  the  tests  includes  such  things  as  the  follow- 
ing: The  tester  dees  not  talk  loud  enough,  he  talks  too  rapidly,  he 
alters  instructions  in  accordance  with  his  own  judgment,  he  gives  extra 
suggestions,  his  manner  of  conduct  is  such  that  the  children  look  upon 
him  as  a  curiosity,  he  rushes  nervously  about  and  gets  everybody's 
nerves  on  edge,  he  forgets  part  of  the  instructions,  he  disregards  time 
limits,  he  unnecessarily  scolds  a  child  and  is  bossy. 

Physical  surroundings  during  the  test  include  such  things  as  break- 
ing pencils,  radiators  discharging  steam,  honking  of  automobile  horns, 
the  ringing  of  bells  and  whistling  of  passing  trains  while  verbal  in- 
structions are  being  given,  poor  light,  testing  thru  recess  period  or  after 
closing  time,  or  while  other  children  are  at  recess,  principal  or  teacher 

(26) 


Educational  Measurements  27 

boisterously  and  possibly  unnecessarily  scolding  a  child,  some  curious 
individual  making  a  nuisance  of  himself  by  passing  up  and  down  the 
aisle  while  the  children  are  at  work,  and  lack  of  sufficient  desk  space  to 
write  comfortably. 

Besides  these  special  conditions  which  affect  the  scores  of  only  that 
particular  individual  or  that  particular  group  which  happens  to  be  the 
victim  of  such  unfortunate  circumstances,  there  are  considerations  and 
features  of  test  construction  and  interpretation  which  affect  all  persons 
tested.  Anything  at  all  can  be  presented  as  a  mental  test,  but  not  all 
things  that  might  be  whipped  into  test  form  belong  in  so-called  mental 
tests.  The  fundamental  criteria  can  be  stated  in  some  such  a  manner 
as  this:  A  mental  test  should  present  an  extensive  variety  of  stimuli 
of  such  nature  that  the  examinee  has  never  before  encountered  them  in 
that  form.  Ideally,  the  variety  of  stimuli  should  be  extensive  enough 
to  make  demands  upon  every  type  and  phase  of  mental  activity,  in 
order,  on  the  one  hand,  to  allow  the  examinee  to  display  his  mentally 
strong  points,  and,  on  the  other  hand,  to  cause  him  to  reveal  his  weak 
points.  As  the  presenting  of  stimuli  in  the  same  form  in  which  they 
have  been  experienced  previously  emphasizes  memory,  it  is  neces- 
sary to  present  them  in  a  manner  new  to  the  individual, 
else  only  memory,  that  one  phase  of  mental  activity,  will  be  tested.  As 
the  presenting  of  entirely  new  stimuli,  no  part  of  which  has  ever  been 
experienced  previously,  calls  forth  nothing  but  a  guess  or  chance  reac- 
tion, if  any  at  all,  it  is  necessary  to  present  stimuli  that  are  new  in  no 
respect  except  the  particular  setting  in  which  they  are  presented  in  the 
mental  test,  i.e.  they  must  tap  old  experiences  from  a  new  angle. 

It  is  impossible  to  devise  any  group  test  which  will  meet  these 
ideal  demands,  because  no  two  people  ever  have  identically  the  same 
environment  and  experiences.  This  does  not,  however,  preclude  the 
possibility  of  constructing  a  series  of  tests  which  for  all  practical  pur- 
poses are  equally  fair  to  the  different  members  of  selected  groups.  On 
account  of  special  environmental  settings  one  person  will  be  enabled 
to  answer  certain  questions  from  memory,  and  will  have  little  or  no 
experience  to  assist  him  in  answering  certain  other  questions.  Varying 
from  individual  to  individual,  never  two  alike,  different  memory  suc- 
cesses will  be  made  possible  and  different  failures  inevitable  according 
to  the  peculiar  environmental  and  educational  background  which  each 
happens  to  have  had.  Thus  each  individual  will  increase  his  score  by 
passing  certain  tests  largely  from  memory  while  others  will  fail  on 
these  same  tests  just  because  their  environment  did  not  give  them  the 
necessary  experiences.  Stated  inversely,  each  will  fail  to  score  on 
certain  tests  because  his  environment  has  not  given  him  the  necessa in- 
experiences, while  others  will  pass  these  same  tests  from  memory. 

With  so  many  variable  factors  actively  involved  in  producing  mental 
test  scores  our  interpretations  and  use  of  them  must  be  made  cautiously 
and  only  in  full  recognition  of  our  limitation  in  so  doing. 

Most  of  the  conditions  already  enumerated  as  capable  of  seriously 
affecting  an  individual's  score  can  do  so  only  by  preventing  it  from 
being  as  large  as  it  would  be  under  normal   conditions.     It   is  obvious 


Bulletin  of  the  Extension  Division 

that  an  individual  may  fall  very  low  on  a  test  because  of  some  tem- 
porary physical  condition,  because  of  some  fault  of  the  examiner,  or  be- 
cause of  the  surroundings  under  which  he  is  required  to  take  the  test, 
or  because  of  all  of  these.  Even  tho  these  are  only  occasional  causes 
of  failure,  they  may  have  serious  bearings  in  those  particular  cases  who 
unfortunately  are  the  victims  of  these  special  conditions,  and  they  must 
ever  be  kept  in  mind  when  passing  judgment  upon  those  making  low 
scores.  Because  these  vitiating  influences  may  strike  we  know  not  where, 
and  obviously  not  according  to  any  rule,  it  is  impossible  to  formulate 
any  general  method  of  handling  them  except  to  be  on  the  alert  for  their 
presence  and  to  make  due  allowances  for  them  when  found. 

Suppose  there  were  no  such  disturbing  influences  as  those  mentioned 
above,  that  every  person  were  physically  perfect,  that  the  tests  were 
presented  exactly  as  they  should  be,  and  that  the  testing  conditions  were 
ideal,  how  should  we  interpret  and  use  such  test  results?  Before  an- 
swering this  question  it  is  necessary  to  consider  the  various  factors  at 
woi'k  in  producing  test  scores.  Since  it  is  impossible  to  devise  a  mental 
test  specifically  applicable  to  more  than  one  person,  the  fundamental 
criteria  underlying  the  construction  of  group  mental  tests  must  be  reck- 
oned with.  The  most  important  criterion  is:  that  the  test  should  pre- 
sent, in  a  manner  new  to  every  member  of  the  group,  a  wide  range  of 
different  types  of  stimuli  which  have  come  within  the  realm  of  their 
previous  experience. 

We  have  already  noted  why  a  wide  range  of  stimuli  is  necessary 
and  why  they  must  be  based  upon  past  experiences.  Our  next  task  is 
to  attempt  to  determine  the  significance  of  scores  resulting  from  such 
test  construction.  This  can  be  done  only  thru  use  of  carefully  defined 
terms. 

The  term  "score"  has  a  definite  meaning  and  is  universally  under- 
stood as  the  number  of  credit  points  which  an  individual  receives  for  his 
answers.  The  measures  of  central  tendency,  average,  median  and  mode, 
and  those  of  variability,  minimum  and  maximum,  mean  deviation,  quar- 
tile  deviation,  and  standard  deviation,  and  the  measures  of  relation  or 
correlation  are  universally  standardized  in  the  statistical  treatment  and 
interpretation  of  score-. 

The  terms  "mental  ability"  and  "intelligence"  are  used  in  so  many 
different  ways  that  they  have  confused  rather  than  clarified,  have  hin- 
dered rather  than  facilitated  the  proper  interpretation  and  use  of  mental 
tests.  This  has  occurred  because,  purporting  to  be  basic  and  elemental 
concepts,  their  use  has  discouraged  and  actually  delayed  further  analy- 
sis. This  unanalyzed  usage  has  permitted  and  encouraged  the  belief 
that  test  scores  are  as  direct  and  reliable  a  measure  of  one's  mental 
ability  and  intelligence  as  pounds  are  of  one's  weight.  This  might  be 
true  if  by  mental  ability  and  intelligence  is  meant  the  individual's  ability 
to  do  things  at  the  very  time  he  is  taking  the  test,  just  as  by  weight 
we  mean  how  much  he  weighs  at  the  time  he  is  on  the  scales.  But  it 
can   hardly   be  true   if   by   mental   ability   and    intelligence   we  mean   the 

Methods  of  determining  and  interpreting  these  are  given  in  detail  in  Harold  <>. 
R  Statistical  Methods  Applied  i<>   Education. 


Educational  Measurements  29 

individual's  innate  capacity   for  mental  development,  unless  by  weight 
is  meant  an  individual's  innate  capacity  for  physical  development. 

Only  thru  the  use  of  unambiguous  terms  will  clarity  be  assured 
and  progress  be  made  in  analyzing-  test  results.  For  this  reason  the 
two  terms,  "mental  ability"  and  "intelligence",  have  no  constructive 
value.  Other  terms  with  definite  and  entirely  distinct  meanings  are 
necessary  to  permit  and  force  a  clear  distinction  between  the  things  an 
individual  does  and  the  significance  of  those  things.  Whenever  an  in- 
dividual does  anything  at  all,  be  it  the  winking  of  an  eye,  the  signing  of 
an  international  treaty,  or  the  taking  of  a  mental  test,  he  is  performing 
an  act  or  series  of  acts.  The  score  of  an  individual  on  a  group  test  is 
only  a  numerical  statement  of  the  credit  value  of  his  performance,  i.e. 
of  what  he  actually  did  on  that  particular  test.  As  an  end  in  them- 
selves this  performance  and  its  score  have  no  value.  Properly  handled, 
they  may  serve  as  a  basis  for  predicting  a  child's  probable  development 
in  the  future.  The  only  way  in  which  future  developments  may  be 
predicted  is  in  terms  of  past  accomplishments  and  present  performances. 

It  would  therefore  seem  advisable  to  employ  the  term  "performance" 
as  Witmer  defines  it,  as  the  doing  of  anything  whatever,  be  it  very 
simple  or  very  complex.  The  taking  of  a  mental  test  is  a  performance 
which  is  evaluated  in  terms  of  credit  points,  called  scores.  It  is  not 
possible  in  terms  of  score  alone  to  predict  the  probable  future  standing 
of  an  individual.  This  situation  demands  the  introduction  of  a  term 
which  clearly  distinguishes  a  performance  from  the  significance  of  that 
performance,  which  distinguishes  an  observed  or  determined  fact  from 
the  interpretation  thereof,  and  which  definitely  emphasizes  the  differ- 
ence between  the  thing  accomplished  and  the  predictive  value  of  that  ac- 
complishment. It  is  undoubtedly  the  failure  to  make  this  distinction 
that  has  caused  more  mistakes  and  more  criticisms  in  mental  testing 
than  any  other  one  thing. 

Tests  should  be  employed  to  serve  two  purposes;  first  to  determine 
what  an  individual  actually  does  at  the  time  he  is  tested,  just  as  scales 
are  used  to  determine  what  a  person  actually  weighs  at  the  time  he  is 
on  the  scales;  and,  second,  as  a  point  of  departure  in  predicting  what 
may  be  expected  of  him  in  the  future.  Under  favorable  conditions  an 
individual  is  likely  to  continue  in  his  mental  and  physical  development 
according  to  certain  fairly  well-established  principles.  It  should  there- 
fore be  possible  to  so  interpret  an  individual's  present  performance  that 
prediction  could  be  made  with  a  fair  degree  of  reliability  as  to  the 
probable  nature  and  extent  of  his  future  perfrrmances.  The  term 
"developability"  is  employed  here  to  designate  the  probable  extensive- 
ness  and  complexity  of  the  highest  performance  level  or  upper  limits  of 
development  for  which  an  individual  furnishes  prospects. 

The  term  "performance"  applies  only  to  what  the  individual  actually 
does.  The  term  "developability"  is  used  to  designate  the  probable  high- 
est level  of  performances  the  individual  will  ever  execute  under  the 
most  favorable  circumstances  for  development. 

These  definitely  defined  terms  enable  a  direct  attack  to  be  made  upon 
the  problem  of  interpreting  mental   test   results.     There  arc  three  pos- 


30  P.ULLETIN    OF   THE    EXTENSION    DIVISION 

sibilities:  first,  that  the  score  of  an  individual  indicates  the  extensiveness 
of  his  environmental  opportunities;  second,  that  it  indicates  his  develop- 
ability;  and  third,  that  it  only  indicates  his  level  of  performance  at  the 
time  he  takes  the  test.  If  the  first  assumption  were  correct,  the  in- 
dividual with  the  best  environmental  opportunities  would  have  the  widest 
range  of  helpful  experiences  and  would  make1  the  highest  score,  the 
one  with  the  next  best  environmental  opportunities  would  have  the  next 
widest  range  of  helpful  experiences  and  would  make  the  next  highest 
score,  and  so  on  for  all  the  individuals  down  to  the  one  having  the 
poorest  environmental  opportunities.  He  would  have  the  narrowest 
range  of  helpful  experiences  and  would  make  the  poorest  score.  If  this 
assumption  were  correct,  scores  would  be  a  measure  of  an  individual's 
past  environmental  opportunities. 

If  the  second  assumption  were  correct,  the  individual  with  the 
highest  developability  would  make  the  highest  score,  the  one  nearest 
him  in  this  respect  would  make  the  next  highest  score,  and  so  on  down 
to  the  one  with  the  lowest  developability.  He  would  make  the  lowest 
score.  If  this  assumption  were  correct,  sec  res  would  be  a  measure  of 
the  complexity  and  extensiveness  of  an  individual'-  developability. 

If  the  third  assumption  were  correct,  the  individual  with  the  highest 
developability  would  probably  make  the  highest  score  only  in  case  he 
had  had  the  advantage.-  of  more  helpful  environmental  influences  than 
any  other  individual.  It  is  also  probable  that  the  individual  with  the 
largest  number  of  helpful  environmental  influences  might  make  a  better 
score  than  others  of  superior  developability  but  not  favored  with  such 
a  large  number  of*  helpful  environmental  influences.  If  this  assumption 
were  correct,  scores  could  only  be  a  measure  of  what  an  individual  act- 
ually does  on  the  test. 

When  viewed  in  this  perspective  there  is  no  doubt  as  to  which  of 
these  three  possibilities  corresponds  closest  to  facts.  It  is  obviously 
preposterous  to  assume  that  mental  tests  measure  the  extensiveness  of 
an  individual's  past  environmental  influences,  and  it  is  just  as  prepos- 
terous to  assume  that  they  measure  his  developability.  The  indispensa- 
ble value  of  experience  has  already  been  indicated.  Without  it  human 
beings  would  be  as  incapable  of  responding  to  mental  tests  as  inanimate 
objects  are.  In  fact  that  is  the  case  when  they  are  given  tests  no  part 
of  which  has  ever  come  within  the  realm  of  their  experience. 

Two,  and  only  two,  measurable  things  determine  what  an  individ- 
ual will  do  on  a  mental  test.  The  first  is  the  nature  and  extent  of  his 
past  environmental  influences;  the  second  is  the  effect  produced  upon  or 
within  him  by  these  environmental  influences.  It  is  reasonable  to  expect 
that  the  effect  which  an  environment  can  have  on  a  person  will  vary 
with  the  length  of  time  it  has  had  opportunity  to  act  upon  him. 

The  most  probable  effect  which  environment  will  have  on  him  in  the 
future  can  be  reliably  predicted  only  in  terms  of  the  effect  which  his 
past  environment  has  had  upon  him.  Prediction  can  therefore  be  made 
only  in  terms  of  the  relative  effectiveness  of  past  environmental  influ- 
ences upon  the  individual.  The  general  nature  and  extent  of  the  environ- 
mental   influences    to    which    an    individual    has    been    subjected    can    be 


Educational  Measurements  31 

satisfactorily  determined  without  much  difficulty.  There  are  two  entirely 
separate  and  distinct  phases  of  environmental  influences  which  must 
always  be  considered.  They  are  the  particular  nature  or  type  of  en- 
vironment to  which  the  individual  has  been  exposed  and  the  extent  or 
length  of  time  of  the  exposure.  Therefore  the  particular  effect  which 
an  environment  can  be  expected  to  have  upon  an  individual  will  depend 
upon  its  specific  nature  and  the  length  of  time  he  has  been  subjected 
to  it. 

The  important  facts  of  an  individual's  past  life  are  the  nature  of  the 
environment  or  environments  to  which  he  has  been  exposed  and  the 
length  of  time  he  has  been  exposed  to  each.  It  would  seem  reasonable 
to  expect  that  the  effectiveness  of  a  given  environment  upon  a  given 
individual  would  vary  directly  with  the  length  of  time  he  has  been  ex- 
posed to  it.  This  would  seem  to  be  especially  applicable  with  children 
reared  in  the  same  community,  because  so  far  as  environmental  influ- 
ences and  experiences  are  concerned  all  children  start  from  the  same 
point,  or  zero.  Each  year  as  the  length  of  environmental  exposure 
increases  they  do  more  things  and  things  more  complex.  Thru  use  of 
.scientific  methods  of  investigation  it  is  possible  to  determine  about  how 
many  and  what  kinds  of  things  children  of  each  age  do.  In  this  way, 
and  in  this  way  only,  can  the  standard  of  accomplishment  for  each  age 
be  determined.  Mental  tests  are  a  modern  device  for  determining  what 
children  do  under  specified  conditions.  It  is  extremely  important  to  note 
that  tests  do  only  this  and  nothing  more.  After  a  large  number  of  chil- 
dren have  been  tested  on  a  particular  test  it  is  possible  to  treat  the  scores 
statistically  and  find  the  average  number  of  credit  points  which  children 
of  each  age  earn.  This  average  is  then  an  established  standard  of  what 
children  of  each  of  the  various  ages  actually  do  under  these  standard- 
ized conditions,  and  may  be  taken  as  the  average  performance  level  for 
the  respective  ages.  Thus  the  average  score  for  six-year-old  children 
establishes  a  standard  of  performance.  Because  it  is  the  average  of  the 
performance  of  six-year-old  children  it  constitutes  the  six-year-old  per- 
formance level.  In  the  same  way  a  performance  level  may  be  established 
for  each  age.  Whenever  a  child  earns  the  appropriate  number  of  credit 
points  on  a  test  to  give  him  the  score  that  is  the  average  for  any  given 
age,  he  may  be  referred  to  as  having  the  performance  level  of  that 
particular  age.  Thus  to  say  that  a  child  has  a  six-year-performance 
level  means  one  thing  and  only  one  thing,  i.e.  that  he  performed  the 
test  in  such  a  manner  that  he  received  the  same  number  of  credit  points 
as  the  average  for  six-year-old  children.  The  difference  between  say- 
ing this  and  saying  he  has  a  mental  age  of  six,  implying  thereby  that 
he  is  like  a  six-year-old  child,  is  greater  than  the  difference  between 
day  and  night.  Only  six-year-old  children  can  be  like  six-year-old 
children.  This  does  not  mean  any  more  about  the  resemblance  of  the 
children  in  other  respects  than  to  say  two  men  are  alike  because  both 
diive  Fords.  If  nothing  else  besides  a  child's  score  is  known,  prediction 
as  to  his  future  is  more  impossible  and  ridiculous  than  prediction  as  to 
the  future  of  the  Ford  drivers. 

An   uninterpreted   mental  test   score   is   useless.     A  misinterpreted 


Bulletin  of  the  Extension  Division 

mental  test  score  is  vicious.  Reliable  interpretation  and  use  of  test 
scores  can  be  made  only  by  giving  due  weight  to  the  known  and  meas- 
urable factors  involved  in  their  production.  The  suggested  interpreta- 
tion already  given  may  lie  summarized  thus:  Altho  score  is  only  a 
number  indicating  the  credit  value  of  an  individual's  performance,  it 
results  from  the  effectiveness  of  the  individual's  past  environmental 
influences.  That  the  effect  which  an  environment  can  be  expected  to 
have  varies  with  the  nature  of  the  environment  and  the  length  of  ex- 
posure to  it  seems  to  be  borne  out  by  the  immense  amount  of  data 
already  collected  on  the  thousands  of  children  tested  by  various  mental 
tests.  Without  exception  the  average  score  for  each  age  increases  with 
increasing  age.  Here  age  is  the  measure  of  the  exposure  time.  This 
does  not  hold  for  adults,  because  with  adulthood  comes  specialization 
which  means  the  restricting  of  environmental  influences  to  a  specific 
type.  Increase  could  not,  of  course,  continue  indefinitely  because  there 
;.re  degrees  of  perfection  which  cannot  be  exceeded. 

Knowledge  of  a  child's  present  and  past  life  is  of  value  as  an  aid 
in  discovering  which  way  he  is  headed,  how  fast  he  is  traveling  in  that 
direction,  and  what  his  probable  highest  performance  level  will  be  event- 
ually, providing  he  is  stimulated  by  favorable  environment.  His  direc- 
tion, rate  of  travel,  and  probable  limit  can  only  be  estimated  in  terms 
of  his  present  performances  taken  as  an  index  of  the  effectiveness  of 
his  past  environmental  influences.  Bailing  abnormal  circumstances,  it 
would  seem  reasonable  to  assume  that  the  rank  order  arrangement  of 
performance  levels  of  children  of  the  same  age  would  remain  practically 
constant  thruout  life.  Thus  if  in  a  given  community  the  performance 
levels  were  determined  for  all  children  on  their  sixth  birthday  and  they 
were  listed  in  rank  order,  we  would  expect  the  rank  order  of  the  new 
performance  levels  of  these  children  to  be  practically  the  same  on  each 
succeeding  birthday.  For  convenience  the  performances  of  these  children 
could  be  thrown  into  100  different  rank  order  groups.  In  the  first  or 
lowest  group  could  be  put  that  1  per  cent  of  the  children  who  earned 
the  fewest  credit  points,  in  the  second  group  could  be  put  the  next  lowest 
1  per  cent  of  the  children,  and  in  this  way  the  groups  could  be  con- 
structed, each  containing  1  per  cent  of  the  total  group,  until  all  were 
included.  As  each  of  these  one  hundred  groups  contains  1  per  cent  of 
the  entire  group,  each  may  be  referred  to  as  a  percentile.  The  par- 
ticular position  which  each  occupies  may  be  indicated  by  numbering 
them  in  order  from  the  bottom  or  lowest  group  up.  The  first  percentile 
contains  the  children  whose  performance  levels  are  so  low  that  99 
per  cent  of  all  children  the  same  age  do  better.  The  second  percentile 
contains  the  children  whose  performance  levels  occupy  such  a  position 
in  the  rank  order  arrangement  that  1  per  cent  do  worse  and  98  per 
cent  do  better.  In  this  way  the  number  of  each  percentile  group  indi- 
cates exactly  what  per  cent  of  children  do  worse  and  what  per  cent  do 
better. 

One  value  of  this  grouping  is  that  it  locates  each  child  with  respect 
to  the  entire  group  and  permits  a  direct  comparison  to  be  made  between 
any  two  children   within  the  group.     An   individual's   percentile  is  not 


Educational  Measurements  33 

only  a  convenient  way  of  presenting-  facts,  but  a  precise  statement  of 
the  exact  location  of  his  work  compared  with  that  of  others  exposed 
to  the  same  environment  for  the  same  length  of  time.  According  to  the 
theory  of  probability  it  is  not  likely  that  the  increased  exposure  to 
environmental  influences  that  is  bound  to  come  with  increasing  age  will 
-eriously  alter  the  percentile  rank  order  arrangement  of  the  children. 
It  should  be  expected  that  the  performance  level  of  all  the  children 
would  be  raised  with  increasing  age,  but  that  the  rank  order  arrange- 
ment of  these  levels,  as  expressed  in  percentiles,  would  remain  prac- 
tically the  same.  Important  changes  in  the  environmental  setting  of 
a  child  or  a  group  of  children  would  undoubtedly  produce  a  change  in 
their  performances  and  cause  their  rank  orders  to  be  shifted  accord- 
ingly. The  great  variety  and  wide  range  of  performances  coming  be- 
tween the  highest  and  lowest  performance  levels  of  children  of  each 
age  produces  considerable  overlapping  of  the  performance  levels  of  chil- 
dren of  different  ages.  This  occurs  because  a  given  performance  level 
is  attained  by  some  children  in  the  course  of,  say,  six  years,  while  others 
do  not  attain  this  performance  level  until  they  have  been  exposed  to  the 
environment  for  seven  years.  There  are  others  who  need  eight  years  of 
exposure.  Certain  individuals  earn  this  performance  level  only  after 
many  more  years,  and  some  never  earn  it. 

If  the  percentile  ranks  of  children  remain  practically  unchanged 
with  increasing  age,  then  the  performance  level  of  each  child  must 
increase  by  a  relatively  constant  ratio  varying  directly  with  the  in- 
creased length  of  time  he  is  exposed  to  the  environment.  This  then  is 
the  principle  of  human  development  which  should  serve  as  a  guide  in 
interpreting  and  using  test  results.  If  children  of  different  ages  make  the 
same  score  on  a  test,  this  demonstrates  them  to  have  the  same  per- 
formance level  on  that  test  at  the  time  the  test  is  given,  but  of  itself 
tells  nothing  about  their  probable  future  performance  level. 

Scientific  classification  of  children  requires  that  each  group  be 
standardized  with  respect  to  three  criteria:  first,  that  they  all  have  the 
same  performance  level  at  the  start;  second,  that  they  all  be  exposed  to 
the  same  environmental,  in  this  case  educational,  influences;  and  third, 
that  the  performance  levels  of  all  increase  at  the  same  rate  so  that  their 
performance  levels  remain  equal  thruout  and  all  finish  at  the  same  time 
with  the  same  performance  level.  Mental  and  educational  tests  are  the 
most  reliable  means  available  for  determining  a  child's  present  per- 
formance level.  Prediction  for  the  rate  of  increase  of  his  future  per- 
formance level  can  be  made  only  on  the  basis  of  measurable  factors  which 
have  produced  his  present  performance  level.  These  measurable  factors 
are  the  nature  of  his  past  environment  and  the  length  of  time  he  has 
been   exposed  to   it,   usually    indicated  by  his   age. 

A  suddenly  accelerated  or  retarded  development  is  hardly  probable. 
The  development  most  likely  to  occur  is  merely  a  continuation  of  that 
which  has  characterized  an  individual's  past,  both  as  to  its  general  na- 
ture, and  as  to  its  relative  rate  of  increase  with  increasing  age.  It 
should  not  therefore  be  expected  that  two  children  of  different  ages 
having  the   same   performance   level   would   continue   to  have  the   same 

3—25670 


Bulletin  of  the  Extension  Division 

performance  levels,  or  would  ever  again  have  the  same  performance  levels, 
any  more  than  that  it  should  be  expected  that  two  trains,  one  a  freight 
and  the  other  an  express,  going  the  same  direction  on  parallel  tracks 
would  continue  to  be  equidistant  from  the  starting-point,  or  would 
ever  again  be  the  same  distance  from  the  starting-point  before  they 
reach  their  destination.  At  just  what  distance  from  the  starting-point 
the  express  will  pass  the  freight  depends  upon  the  rate  of  travel  for 
both  and  how  many  hours  the  freight  left  the  starting-point  ahead  of 
the  express.  If  the  express  left  first  they  never  would  be  together. 
In  this  way  each  child  may  be  thought  of  as  either  an  express  or  a 
freight,  depending  upon  the  rate  of  progress  of  the  other  individual  or 
individuals  of  the  group  with  which  he  is  being  compared.  Each  child 
in  the  course  of  his  life  will  have  the  experience  of  being  both  a  freight 
and  an  express.  There  are  enough  children  in  the  world  that  at  every 
stage  of  each  child's  development  his  performance  level  is  reached  by 
children  who  left  the  starting-point  after  he  did,  some  of  them  a  long  time 
after,  and  there  are  enough  children  traveling  slower  than  he  that  at 
every  stage  of  his  development  his  performance  level  will  reach  that  of 
children  born  before  he  was,  some  of  them  a  long  time  before.  It  is 
therefore  exceedingly  important  to  recognize  the  fact  that  those  chil- 
dren who  make  the  same  -core  and  consequently  have  the  same  per- 
formance level  on  a  test  were  caught  just  at  the  time  they  happened  to 
be  even,  because  just  at  that  time  they  were  passing  one  another  in  the 
various  stages  of  their  development.  How  fast  they  are  traveling  or 
what  their  rate  of  development  is  can  be  determined  only  by  reference 
to  the  length  of  time  consumed  in  arriving  at  this  particular  performance 
level. 

Those  children  of  this  group  who  are  of  the  same  age  are  more  likely 
to  be  traveling  at  the  same  rate  and  to  have  the  same  rate  of  development 
than  any  others.  This  gives  a  basis  for  selecting  children  to  be  put  to- 
gether in  classes.  They  should  be  the  same  age  and  should  have  the 
same  performance  level.  The  performance  level  indicates  the  point  at 
which  to  begin  work  in  setting  the  educational  environment  for  future 
development.  The  rate  at  which  sections  may  be  expected  to  develop 
within  this  environment  varies  inversely  with  the  age  of  the  children 
haing  the  same  performance  level. 

That  this  is  exactly  what  happens  is  clearly  demonstrated  in  the 
age-grade-score  distribution  of  Table  I.  This  table  is  compiled  from 
data  collected  by  the  speaker  in  a  survey  of  the  city  schools  of  Youngs- 
town,  Ohio.  The  National  Intelligence  Tests  were  used  in  testing  all 
children  from  low  third  thru  the  eighth  grade  of  twenty-five  buildings. 
Age  eight  includes  all  children  who  had  reached  their  eighth  birthday, 
but  who  had  not  yet  reached  their  ninth  birthday  on  the  day  of  being 
tested.  All  other  age  groups  include  children  whose  ages  are  similarly 
determined. 


Educational  Measurements 


35 


Table  I.    National  [ntelligence  Tests,  Scale  A:     Average  Score  for  each 
age-grade  group  having  at  leasl  20  children  (8,167  children) 


Age 

3B 

3A 

IB 

1  \ 

5B 

:.\ 

6B 

6A 

7B 

7A 

8B 

8A 

Average 

7 

37 

36 

30 
27 
25 

10 

8 

10 
46 
41 
38 
34 

65 
62 
56 
51 
46 
11 

11 

9 

7'.i 
72 
67 
59 

.".7 

98 

87 
si 
76 
69 
65 

55 

ID 

99 
93 
89 

80 

77 

1CI 

105 
95 
93 

87 
86 

66 

11 

12 

115 

107 

104 

99 

87 

130 

lis 
112 
105 
101 

83 

132 
126 
111 
112 

1  12 
136 
127 
1 22 

142 
137 
127 

i:;i 

94 

13 

L06 

14 

111 

L5 

109 

16 

114 

17 

114 

31 

44 

53 

69 

80 

89 

97 

105 

111 

136 

Average 

123 

130 

Average 

Table  II.     National  Intelligent:   Tests,    Scale   A:     Percentage   value   of 
scores  for  each   age-grade   group  compared  to  age  average 


Age 

3B 

3A 

4B 

4A 

5B 

5A 

6B 

0A 

7B 

7A 

8B 

8A 

7 

93 

80 
55 
41 
30 

8 

9 

111 
84 
62 
46 
38 

150 
113 
85 
61 
49 
42 

1  II 

109 

81 

62 

54 

178 
132 

98 
81 
05 
59 

; 

10 

150 

112 

95 

70 

70 

158 

120 
101 

88 
80 

si 

11 

139 

11  1 
98 

00 

80 

157 
126 
106 
106 

03 

12 

140 
12:: 
100 
103 

151 

130 
114 

112 

13 

134 

14 

123 

15 

117 

118 

Table  I  gives  the  average  score  for  each  age-grade  group,  having  at 
least  20  children.  Averages  are  not  included  where  there  are  less  than  20 
children  in  a  group.  In  3B  for  age  7,  the  37  means  that  is  the  average 
score  for  at  least  20  children  seven  years  old  found  in  3B.  For  age 
eight  the  36  means  the  average  score  for  at  least  20  children  eight  years 
old  found  in  3B.  The  lest  of  the  table  is  read  in  the  same  way.  The 
last  column  at  the  right-hand  edge  of  the  table  gives  the  average  score 
of  all  children  for  each  age  of  the  entire  city.  The  average  for  seven- 
year-old  children  is  40,  for  eight-year-old  children  is  44,  etc.,  for  each 
age  down  the  column. 

The  numbers  across  the  bottom  of  the  table  are  the  average  scores 
for  each  grade.  One  interesting  thing  revealed  by  this  table  is  that 
within  each  grade  the  average  score  decreases  regularly  with  increasing- 
age,  i.e.  there  is  an  inverse  ratio  between  score  and  age.  That  holds 
true  with  but  one  exception  up  to  age  15.  This  constant  decrease  in 
score  with  increasing  age  within  every  grade  is  suggestive  of  the  work- 


Bulletin  of  the  Extension  Division 

ing  of  some  definite  principle  which  cannot  safely  be  disregarded  in 
efforts  to  group  children  scientifically. 

If  the  numbers  are  read  across  the  table,  starting  with  3B  of  any 
age  thru  the  different  grades,  the  scores  increase  regularly  to  the  last 
score.  This,  of  course,  is  the  working  of  the  same  principle  just  re- 
ft rred  to. 

Table  II  is  constructed  from  Table  I  by  dividing  the  average  score 
for  each  age  into  the  average  score  of  each  grade  group  within  that  age, 
so  that  the  93  for  seven-year-old  children  in  .">B  means  that  they  did  93 
per  cent  of  the  city  average  for  seven-year-old  children.  The  eleven- 
year-old  children  in  3B  made  an  average  score  of  only  30  per  cent  of 
the  average  score  for  all  eleven-year-old  children.  The  percentage  de- 
creases consistently  with  increasing  age  within  any  given  grade  up  to 
age  fifteen. 

The  averages  for  years  fourteen  and  fifteen  are  not  fully  compa- 
rable with  those  for  the  other  ages  because  only  elementary  school  chil- 
dren are  included.  Many  children  fourteen  and  fifteen  years  of  age 
were  in  high  school.  Thus  the  brightest  fourteen  and  fifteen-year-old 
children  are  not  included  as  probably  only  the  duller  ones  had  remained 
in  the  elementary  schools. 

This  inverse  ratio  between  age  and  performance  level  within  any 
given  grade  must    be   reckoned   with   in  the  grading  of  children.     It  has 


Educational  Measurements 


37 


Table  III. 

Percentile  Ranks  of  Scores  on  Indiana  University  Menial  Survey  Tests, 
Schedule   E,   Cross-Out  Test.     Jan.   1,    1923. 


Score 


0. 

1. 

2. 

3 

4 

5. 

(i. 

7. 

8. 

9. 
10 
11 
12. 
13. 
14 
15. 
16. 
17. 
18. 
19. 
20. 
21. 
22 
23. 
24. 
25. 
26. 
27. 
28. 
29. 
30. 
31. 
32. 
33. 
34 
35. 
36. 
37. 
:;s 
39. 
40. 
41. 
42. 
43. 
44. 
45. 
46. 
47. 
48. 
49. 
50. 


Ages 

9   10 

11  12  13 

'l 

'.'.'.'.    "'.3.'.'.'. 

...   .1 

3  .... 

1 

3  .... 

...   .1 

5  .... 

.2   .1 

.1   .5  .... 

.5   .1 

.1   .5  .  .  .  . 

.7   .1 

.3  .1  .... 

.7   1 

.5  .1  .... 

.1   .1 

.8   .8.... 

1   .3 

.8   .8.... 

1   .5 

.8   .8.... 

1   .6 

.8   .8.... 

2   .6 

1   .8  .... 

3   1 

1   .8  .... 

4   1 

1   1   .2 

4   1 

1   1   .2 

5   2 

1   1   .2 

6   3 

1   1   .4 

8   4 

1   1   .6 

10   5 

3   1   .8 

12   6 

4   2   1 

15   7 

4   2   1 

17   8 

4   3   2 

19   10 

5   4   2 

21   12 

6   4   2 

23   13 

7   5   3 

25   15 

9   6   3 

30   10 

11   6   4 

33   19 

12   7   4 

37   20 

13   8   5 

40  23 

16  10   6 

44  26 

17   12   8 

48  30 

19   14   9 

52  35 

20   16   10 

55  38 

24   18   12 

;»s  1 1 

26   20   13 

61   44 

29   22   14 

64  48 

32   25   16 

67  51 

35   27   18 

71   54 

38  31   20 

73  57 

42  33   23 

76   liO 

45  35   25 

78   64 

48  38  28 

SO  (17 

52  40  30 

81   70 

54  43  32 

82   72 

56  46  35 

84  74 

59  48  37 

Score 


51. 

.-,2 

53. 

54. 

55. 

56. 

57. 

58. 

59. 

60. 

61 

62. 

63. 

64 

115 

66. 

67. 

68. 

69. 

70. 

71. 

72. 

73. 

74. 

75. 

76. 

77. 

78. 

79. 

80. 

81. 

82. 

83. 

84. 

85. 

86. 

87. 

88. 

89. 

90. 

91. 

92 

93. 

94. 

95 

'.If, 
!)7 
98. 
99. 
100 


Total    number 
of  cases 


Ages 


'.) 


S5 
87 

ss 

89 
90 
!)() 
91 
93 
94 
94 
95 
95 
95 
95 
96 
97 
97 
97 
98 
98 
98 
98 
98 
99 
99 
99 
99 
99 
99 
100 


Hli, 


10   11   12   13 


77 
80 
83 
85 
86 
87 
89 
91 
92 
93 
95 
95 
95 
95 
96 
97 
97 
97 
98 
99 
99 
99 
99 
99 
99 
99 
99 
100 


1112 


til' 
65 
07 
70 
73 
75 
77 
80 
82 
84 
85 
87 
88 
89 
91 
92 
93 
94 
95 
96 
96 
97 
98 
98 
98 
98 
98 
99 
99 
99 
99 
99 
99 
99 
99 
99 
100 


t;oi 


52 
56 

58 

60 
63 
66 
69 

72 
74 
76 
77 
80 
82 
84 
86 
88 
88 
89 
91 
92 
93 
94 
95 
96 
97 
97 
97 
98 
98 
99 
99 
99 
99 
99 
99 
99 
99 
99 
99 
')'.  i 
99 
99 
'I' i 
99 
99 
100 


595 


39 
42 
44 

47 
oil 
.12 
56 
58 
60 
62 
66 
69 
72 
74 
77 
79 
81 
82 
84 
85 
86 
89 
92 
94 
95 
96 
97 
97 
98 
99 
99 
99 
99 
100 


514 


38 


Bulletin  of  the  Extension  Division 


been  a  common  practice  to  examine  all  the  children  of  a  given  grade  and 
then  group  them  into  classes  on  the  basis  of  uninterpreted  raw  scores.  In 
the  light  of  these  data,  it  would  seem  that  such  a  procedure  of  grouping 
may  nearly  if  not  entirely  defeat  its  own  purpose. 

Such  a  situation  could  easily  occur  and  in  fact  does  occur  whenever 
sections  of  a  given  grade  of  a  huge  school  are  formed  by  arranging  the 
children  in  the  order  of  the  ranks  of  their  scores  and  then  assigning 
them  to  sections  on  the  basis  of  these  rank  orders.  The  inevitableness 
of  this  can  be  shown  best  by  analysis  of  some  actual  conditions.  Last 
November  thru  the  courtesy  of  Mr.  II.  N.  Tirey,  superintendent  of  the 
Bloomington  schools,  and  his  co-workers,  and  with  the  assistance  of 
my  class  in  Mental  Measurements,  all  the  children  of  the  Bloomington 
public  schools  from  the  first  grade  up  to  and  including  the  freshmen  of 
the  high  school  were  tested.  From  the  third  grade  up  the  children  were 
tested   by  use  of  the  Indiana   University  Mental   Survey  Test,   Schedule 


Table  IV. — Age  and   Percentile   Ranks  of  •!()  Children  from  Grade  5B, 
made  the  Same  Seme  on  a  .Menial  'IV.-i 


\v  1 1  o 


Age 

Number  of 
(  nildren 

Percentile 

Rank 

8 

1 
•1 

r_> 

7 
2 

2 
2 

■' 

!l 

i'»7 

10.  . 

51 

1  1 

35 

12 

"7 

13                                       

IS 

14                                            

? 

E,  the  Cross-Out  Test.  The  results  obtained  in  this  survey  were  com- 
bined with  those  reported  from  other  cities,  and  a  percentile  table  was 
constructed  for  each  age  from  nine  to  thirteen  inclusive.  These  were 
the  only  ages  within  which  percentiles  could  be  determined  with  a  fair 
degree  of  reliability.  The' highest  possible  score  on  the  Cross-Out  Test 
is  100.  The  percentile  ranks  for  this  test  are  given  in  Table  III.  The 
scores  are  placed  along  the  left  edge  of  the  table  outside  the  vertical 
line.  The  numbers  in  the  body  of  the  table  indicate  the  percentile  rank 
of  each  score  for  each  age.  Under  age  nine  is  placed  the  percentile 
rank  of  each  score  made  by  nine-year-old  children.  The  percentile  ranks 
for  the  other  years  are  placed  in  their  respective  columns.  A  score  of 
25  has  a  percentile  rank  of  15  when  earned  by  a  nine-year-old  child,  a 
percentile  rank  of  7  when  earned  by  a  ten-year-old  child,  of  4  for  an 
eleven-year-old,  of  2  for  a  twelve-year-old,  and  of  only  1  for  a  thirteen- 
year-old  child.  The  percentile  rank  of  15  assigned  to  the  score  25  for 
a  nine-year-old  means  that  14  per  cent  of  nine-year-old  children  make 
lower  scores,  and  that  85  per  cent  of  nine-year-old  children  make  higher 
scores.      When   a  thirteen-year-old  child  makes   a   score  of  25,  his  per- 


Educational  Measurements  39 

centile  rank  of   1    means  that  less  than  1  per  cent  of  thirteen-year-old 
children  make  lower  scores  and  that  99  per  cent  make  higher  scores. 

In  the  Bloomington  survey  last  November,  30  children  from  grade 
5B  made  scores  between  40  and  44.  As  42  is  the  average  score  of  this 
interval  and  all  of  the  30  children  lie  within  two  score  points  of  it,  it 
will  simplify  this  discussion  to  assume  that  all  30  made  the  same  score 
of  42.  The  distribution  of  these  children  according  to  age  and  per- 
centile ranks  is  given  in  Table  IV.  The  percentile  ranks  for  the  children 
eight  and  fourteen  years  old  could  not  be  determined,  but  that  the  eight- 
year-old  would  have  a  high  rank  and  the  fourteen-year-old  a  low  rank 
cannot  be  doubted.  It  is  not  at  all  likely  that  these  30  children  put  into 
one  class  would  constitute  a  very  satisfactory  class.  They  were  caught 
by  the  test  just  at  the  time  their  performance  levels  happened  to  be  the 
same.  This  will  never  happen  again  for  these  children.  Yet  they  are 
the  children  that  would  be  placed  together  under  one  teacher  if  the 
classes  were  selected  on  the  basis  of  the  rank  order  arrangement  of 
scores  of  all  children  within  the  grade.  A  group  of  30  children  whose 
names  had  been  drawn  from  a  hat  containing  a  shuffled  list  of  all  the 
children  of  the  5B  grade  would  probably  constitute  about  as  satisfac- 
tory a  class. 


Bulletin  of  the  Extension  Division 

Table  V. — Percentile  Limits  of  Each  Age  Found  in  the  Various  Grades. 

Bloomington   Survey,  Nov.,   1922 


( Irades 

6 

7 

(ill 
100 

i:, 
LOO 

10- 
100 

30 

100 

10- 
100 

10 

100 

9 

10 

1 1 

95 
L00 

95 
100 

70 
100 

ill) 
100 

20 
LOO 

20 
89 

•") 
74 

5- 
59 

0 
29 

0 
24 

0- 

1  1 

12 

85 

100 

75 

100 

CO 
100 

30 

1(10 

20 

10(1 

."» 
94 

0 

64 

0- 

19 

(l 
44 

0- 
4 

0- 

1  1 

o 

4 

13 

8A 

40- 

100 

si; 
7\ 

on 
100 

70 
100 

15 

100 

15 
100 

15 

100 

5 
89 

0 

84 

0 

II 

0 
39 

30 

100 

20 
100 

7B 
6A 
6B 
5  A 
5B 
l\ 

95 
100 

80 
100 

10 

100 

25 
94 

20 
94 

5- 

(19 

:. 

59 

0- 
54 

0- 
39 

0 

1 1 

IB 

5 
94 

0 
89 

0 
59 

0 
19 

3A 

3B 

2  A 

90 

94 

60 

10(1 

10- 

so 

0- 

4 

2B 

0- 
94 

0 
64 

1  \ 

5- 

89 

IB 

0- 
100 

0- 
94 

0- 

49 

Educational  Measurements 


n 


Tabic   VI.      Number   of   Cases    Include!    in   Tabic  V. 


Grade 

6 

7 

8 

9 

10 

11 

12 

13 

8A                          

1 
1 

6 

15 
23 

47 
28 
27 
16 
17 
13 
2 

5 

1 

33 

8B 

35 

7  \ 

27 

7B 

4 
4 
18 
21 
55 
29 
33 
21 
18 

L5 
25 

47 
32 
26 
25 
18 
15 
2 

25 

6  V 

22 

6B 

2 
I 
17 
30 
59 
35 
47 

1!) 

5  \ 

13 

5B                           

12 

4A                           

4 

4B 

7 

3A                   

25 
87 
36 
42 
9 
0 

3B                   

1 

6 
'-'7 
87 
43 
43 

1 

2A 

2B 

7 

15 

150 

1  \ 

IB 

After  the  Bloomington  school  children  had  been  tested,  the  per- 
centile rank  was  computed  for  each  child  and  distribution  tables  were 
made  for  the  city.  Table  V  gives  the  lowest  and  highest  percentile  rank 
for  the  children  of  each  age  found  in  each  grade.  The  numbers  across 
the  top  of  the  table  indicate  the  ages,  those  along  the  left  side  the  grades, 
and  those  within  the  body  of  the  table  the  percentiles.  The  percentiles 
go  in  pairs  separated  by  a  hyphen.  The  first  one  of  each  pair  is  the 
lowest  percentile  rank  of  any  child  within  the  age-grade  group  at  the 
intersection  of  which  it  is  located.  The  second  of  each  pair  is  the 
highest  percentile  rank  of  any  child  within  that  age-grade  group.  Writ- 
ing the  minimum  percentile  in  the  upper  left  corner,  and  the  maximum 
percentile  in  the  lower  right  corner  of  the  rectangular  intersections  of 
the  age  columns  and  grade  rows,  facilitates  reading  of  either  the  mini- 
mum or  maximum  percentiles  as  they  vary  from  grade  to  grade  within 
each  age  column,  or  as  they  vary  from  age  to  age  within  each  grade 
row.  There  were  a  few  children  under  nine  years  of  age  in  grades 
above  the  third,  and  there  were  also  a  number  of  children  over  eight 
years  of  age  in  grades  below  the  third,  but  percentiles  could  not  be  de- 
termined for  them  because  the  mistake  was  made  of  testing  the  children 
by  grades  instead  of  by  ages.  One  of  the  most  significant  features  of 
this  table  is  the  consistent  increase  in  size  of  percentile  rank  from  the 
lowest  to  the  highest  grade  within  each  age.  In  this  connection  it  should 
be  remembered  that  school  grades  are  merely  standardized  educational 
performance  levels.  Some  of  the  interesting  facts  revealed  are  such  as 
these:  that  all  eleven-year-old  children  with  percentile  ranks  of  90  or 
more  are  found  above  the  fifth  grade;  that  no  eleven-year-old  child  with 
a  percentile  lower  than  20  i.s  found  above  the  low  fifth  grade;  that 
whereas  there  is  no  nine-year-old  child  below  the  fortieth  percentile  in 


12  Bulletin  of  the  Extension  Division 

the  fifth  grade,  iMeve  is  no  thirteen-year-old  child  above  the  fortieth 
entile  in  this  same  grade.  Situations  like  the  one  mentioned  last 
may  be  explained  by  application  of  principles  of  human  development 
suggested  earlier  in  this  discussion,  that  the  particular  performance 
level  an  individual  earns  has  of  itself  little  value  for  purposes  of  pre- 
diction, and  that  the  most  important  fact  for  purposes  of  prediction  is 
the  length  of  time  a  child  is  exposed  to  the  environment  before  he 
reaches  that  performance  level.  Thus  low  percentile  thirteen-year-old 
children  and  high  percentile  nine-year-old  children  are  found  function- 
ing at  the  same  educational  performance  level,  because  a  given  environ- 
ment produces  as  much  of  an  effect  upon  a  high  percentile  child  in  nine 
years  as  it  produces  upon  a  low  percentile  child  in  thirteen  years. 

Table  VI  gives  the  number  of  children  of  each  age-grade  group  re- 
ported in  Table  V. 

SUMMARY 

The  detailed,  rather  theoretical  discussion  in  the  first  part  of  this 
paper  is  the  outgrowth  of  the  author's  efforts  to  interpret  facts  like 
those  met  with  in  the  surveys  reported  in  the  latter  part  of  the  paper. 
This  summary  is  a  re-statement  of  the  main  points  brought  out  by  the 
discussion  and  the  data  of  this  paper. 

1.  For  practical  purposes  it  may  be  assumed  that  all  children  of 
a  given  community,  and  possibly  of  a  given  state,  or  even  entire  country, 
are  subject  to  environments  fundamentally  so  similar  that  they  may  be 
considered  of  equal  value  in  stimulating  development. 

2.  It  may  be  assumed  that  when  children  of  the  same  environment 
and  of  the  same  exposure  time,  i.e.  same  age,  have  markedly  different 
performance  levels,  these  differences  constitute  the  best  available  index 
of  their  developability. 

3.  Score  on  a  mental  test  is  merely  a  numerical  credit  value  of  an 
individual's  performance  on  that  test  at  that  time. 

4.  Score,  i.e.  performance  level  on  a  given  test,  has  no  predictive 
value  except  as  interpreted  by  giving  due  consideration  to  the  specific 
nature  of  the  past  environment  to  which  the  child  has  been  subjected 
and  to  the  length  of  time  he  has  been  exposed  to  that  environment.  If 
children  have  fundamentally  the  same  environment,  age  is  all  that  need 
be  considered. 

5.  A  rank  order  arrangement  of  scores  or  performance  levels  of 
children  of  exactly  the  same  age  indicates  the  relative  effectiveness  of 
their  past  environmental  influences  and  probably  is  the  most  reliable 
index  available  of  the  effect  that  their  future  environment  is  likely  to 
have  upon  them. 

6.  The  percentile  arrangement  of  scores  within  each  age  provides 
a  very  satisfactory  method  of  reliably  locating  the  relative  position  of 
each  child's  performance  level  with  respect  to  that  of  all  children 
having  the  same  environmental  influences.  It  is  more  convenient  and 
more  useful  than  any  other  method  of  presenting  these  facts.  It  has 
the  added  advantage  of  preserving  the  facts  at  their  face  value.  It  is 
not  a  theoretical  manipulation  of  data.  It  is  just  a  rank  order  arrange- 
ment with  100  ranks. 


Educational  Measurements  43 

7.  Percentile  ranks  have  their  greatest  value  when  determined 
from  a  very  large  group  of  children  for  each  age.  Mental  test  standards 
presented  in  the  form  of  a  percentile  table,  like  Table  III,  enable  every 
user  of  the  test  to  compute,  directly  from  this  table,  the  standard  per- 
centile rank  of  the  scores  made  by  his  children.  This  permits  him  to 
compare  his  children  with  children  from  other  cities  on  a  standard  basis 
just  as  easily  as  he  compares  his  children  with  each  other. 

8.  Tests  should  be  given  to  children  by  ages  and  not  by  grades.  In 
no  other  way  is  it  possible  to  secure  reliable  percentiles  on  the  perform- 
ance of  all  children. 

9.  Scientific  classification  of  children  demands  that  each  class 
should  be  made  up  of  children  who  are  of  exactly  the  same  age  and  who 
have  the  same  level  of  performance.  All  children  within  a  given  school 
grade  presumably  have  the  same  educational  performance  level,  but  they 
do  not  all  have  the  same  developability.  Developability  cannot  be  meas- 
ured; it  can  only  be  inferred.  Mental  tests  are  a  convenient  aid  in  es- 
tablishing indices  of  developability.  Developability  cannot,  however,  be 
inferred  with  certainty  from  a  child's  performance  on  only  one  mental 
test.  If  results  from  only  one  mental  test  are  available  and  it  is  de- 
sired to  use  these  in  reclassifying  a  school,  the  following  is  a  suggested 
procedure : 

First.     Determine  from  a  standard  percentile  table  the  percentile  ranks 

of  all  children. 
Second.     Make  a  list  of  the  children  in  each  grade.     With  each  name 

give    the    child's    exact    age    in    years    and    months,    and    his 

standard  percentile  rank. 
Third.     Put  together  in  classes  those  children  from  each  grade  who  are 

of  the  same  age  and  who  have  the  same  percentile  ranks.   This 

is  seldom  possible.     It  should,  however,  serve  as  a  guide  and, 

as  an  ideal,  should  be  approximated  as  nearly  as  conditions 

permit. 


Improving  Instruction  Thru   Educational 

Measurement 


Frank  \V.  Ballou,  Superintendent  of  Schools,   Washington,  D.C. 


(Illustrated  with  lantern  slides.) 

The  ultimate  purpose  of  all  educational  measurement  is  to  increase 
the  effectiveness  of  the  instruction  which  the  child  receives,  and  thereby 
to  improve  the  educational  results  which  the  child  possesses  from  that 
instruction.  Every  administrative  agency,  every  special  teacher  or  su- 
pervisor, as  well  as  as  every  regular  teacher,  all  educational  equipment; 
in  fact,  everything  pertaining  to  the  school  system,  is  fundamentally  for 
the  purpose  of  providing  the  instruction  which  the  child  needs  to  make 
him  individually  and  socially  efficient. 

Improvement  of  instruction  thru  educational  measurement  necessi- 
tates the  following'  of  a  systematic  procedure  consisting  of  several  steps: 
first,  finding  out  what  educational  results  arc  being  obtained;  second, 
evaluating  those  results  by  applying  to  them  objective  standards  of 
achievement  which  have  been  established;  third,  getting  the  results  of 
this  evaluation  to  those  responsible  for  the  results  or  their  improvement; 
and  fourth,  getting  officers  and  teachers  to  attempt  improvement. 

This  address  is  based  on  the  methods  pursued,  and  the  results  se- 
cured in  the  giving  of  the  Courtis  Standard  tests  in  Arithmetic  in  Bos- 
ton beginning  in  11)12  and  extending  over  a  period  of  four  or  five  years. 
It  indicates,  in  my  judgment,  a  procedure  which  may  be  followed  any- 
where, under  proper  leadership,  and  in  many  subjects  other  than  arith- 
metic. 


STANDARD  TESTS 

USED 

The  Courtis 

St 

mdard 

Tests  were  used 

of 

wh 

ch  the  following  are 

illustrations : 

Addition 

Form 

1 

Form   2 

Form   3 

927 

127 

339 

379 

375 

276 

756 

935 

977 

837 

333 

861 

924 

325 

757 

110 

911 

698 

854 

554 

269 

965 

167 

136 

344 

554 

322 

(41) 


Educational  Measurements 


i:» 


SubU  action 

107795491 

114957187 

102142649 

77197029 

90271798 

70428369 

Multiplication 

8246 

8259 

9237 

29 

28 
Division 

27 

49)28420 

24)6984 

26)7306 

MEDIAN  IS  THE  MEASURE 

"Median"  is  the  statistical  term  used  to  define  the  measurement  of 
the  central  tendency  of  a  group.  "Median  point"  means  the  middle  point 
in  a  statistical  distribution,  on  either  side  of  which  there  is  an  equal 
number  of  cases.  The  median  achievement  of  a  class  means  the  achieve- 
ment of  that  boy  or  girl  who  stands  in  the  middle  of  the  class  when  the 
pupils  of  the  class  are  arranged  in  a  straight  line  according  to  the 
achievement  of  each  individual. 

The  median  is  illustrated  by  the  following: 


CHART     No   3 
DISTRIBUTION  of  JANUARY   and  APRIL  SCORES   COMPARED. 

4  5         6  6  Subtraction    Rights       Grade.  TZ 


14 
13 
12 

January 

Med 

ion 

r- 

April    M 

ediar. 

, 

y- 

\ 

/ 

/ 
r 

% 

i 

1 1 
10 
9 

5* 

uJ    7 
O    ' 

u  6 

, 

i 

i 
i 

i 
\ 
i 

\ 

t 
i 

i 
l 
\ 

i 

i 
l 

t 
i 

* 

t 

! 

\ 

h 

/ 
/ 

* 

-Ja 

nuar  y 

/ 

% 

-Ap 

r.l 

5 
4 

3 

2 

^v 

t 
/ 
/ 

i 
l 

N 

/ 

l 

.. 
■*** 

\ 

\ 
\ 

\ 

1 
n 

** 

^ 

*i." 

•  •• 

-  — -- 

— . 



«-T 

0     I     23456789    10    I 


12    13    14    15     16    17    18    19   20  2i    22   25  24 
SCORES 


Solid  line  traces  the  January  distribution. 

Dotted  line  traces  the  April  distribution. 

Solid  vertical  line  shows  where  the  January  city-wide  median  falls 


Bulletin  of  the  Extension  Division 


Dotted  vertical  line  shows  where  the  April  city-wide  median  falls. 
The    figures   along   the   base  line   indicate   the   number  of  examples 


right. 

The  figures  at  the  left  indicate  the  per  cent  of  children  making  each 
score. 

Two  important  factors  are  illustrated  by  this  chart;  namely,  the 
more  extended  distribution  of  the  April  scores  and  the  advancement  of 
the  April  median.  For  January  the  number  of  examples  correct  ranges 
from  no  examples  to  18  examples,  while  for  April  the  range  is  from  no 
examples  to  21  examples.  This  extension  of  the  distribution  is  a  mark 
of  increased  ineffectiveness  in  the  teaching.  The  January  median  is  4.5 
examples,  the  April  median  is  <>.<>  examples.  This  advancement  of  the 
median   is  a  mark  of  improvement. 

METHODS  EMPLOYED  TO  IMPROVE  RESULTS 

Formal  reports  were  made:  (a)  to  teachers,  (b)  to  principals,  (c) 
to  assistant  superintendents  and  superintendent.  Printed  bulletins  were 
distributed,  objective  standards  were-  established,  and  systematic  prac- 
tice material  was  introduced.  This  address  will  deal  with  as  many 
phases  of  these  topics  as  time  will  permit. 

OBJECTIVE    STANDARDS    OF    ACHIEVEMENT   IN   ARITHMETIC 


Addition 

Subtraction 

(  I RADI 

Number  of 
Examples 
\t  tempted 

Per  ( !ent 

of 
Accuracy 

Efficiency 

Number  of 

Examples 

Attempted 

Per  Cent 

of 
Accuracy 

Efficiency 

VIII  ... 

12 

11 

1(1 

'.1 

S 

S!) 
SI) 
70 
7(1 
70 

33   1 
30   1 
12  5 

:;i  :; 
27  5 

12 
II 

10 
!l 

7 

00 
00 
00 

80 

so 

33.9 

VII 

28  5 

VI     

33  S 

V 
[V 

34.4 
33.7 

Multiplical  ion 

1  )i  vision 

Grade 

Number  of 
Examples 
At  tempted 

Per  <  i m 

of 

Accuracy 

Efficiencj 

Number  of 
Examples 
Attempted 

Per  Cent 

of 
Accuracy 

Efficiency 

\  III 

II 

10 
!l 
7 
i; 

80 

so 

so 

70 
60 

32   1 

:;i  5 

27  5 
38  7 

11 
10 

s 

0 
4 

00 

39  7 

All 

00              32   1 

VI 

SO 

70 

00 

43  5 

V 

39.9 

IV 

40  2 

Educational  Measurements 


47 


OBJECTIVE  STANDARDS  OF  ACHIEVEMENT  ESTABLISHED 

The  standard  for  each  grade  is  based  on  the  median  achievement  of 
all  the  pupils  in  that  grade  in  1915.  It  furnishes  a  definite  goal  for  each 
pupil  of  the  grade  and  one  which  he  can  reasonably  be  expected  to  reach 
with  the  right  kind  of  training.  A  teacher  should  see  that  her  class  is 
trained  to  reach  the  standard  and  little  if  any  more.  A  class  has  a 
satisfactory  record  when  its  median  achievement  approximates  the  stand- 
ard in  number  of  examples  attempted,  in  per  cent  of  accuracy,  and  in 
efficiency  (the  per  cent  of  pupils  who  reach  or  exceed  the  standard  in 
both  speed  and  accuracy). 

For  example :  The  median  which  is  taken  as  the  standard  for  Grade 
eight  shows  that  one-half  of  the  class  attempted  at  least  12  examples 
and  did  80  per  cent  of  them  correctly;  and  that  33  per  cent  of  the  class 
reached  or  exceeded  this  standard. 

The  achievement  of  a  class  in  speed  should  reach,  but  not  materially 
exceed,  the  established  standard  for  that  grade.  Achievement  in  ac- 
curacy should  at  least  reach  the  established  standard  and  may  approach 
as  nearly  100  per  cent  as  possible.  The  efficiency  of  the  class  should  at 
least  reach  the  established  standard  and  may  approach  as  nearly  50  per 
cent  as  possible. 

REPORTS   TO  PRINCIPALS  AND   SUPERINTENDENTS 

(a)  Statistical.  The  kind  of  a  statistical  report  which  was  sent  to 
each  principal  and  superintendent  before  the  close  of  school  in  June 
showing  the  standing  of  each  class  in  his  school  as  a  result  of  the  May 
tests  is  shown  in  the  following  illustration: 


Room 

Grade 

VIII 

101 

VIII 

119 

VIII 

118 

VIII 

227 

VII 

77 

VII 

133 

VII 

235 

VII 

136 

VII 

238 

Test  No.  1, 
Addition 


Median 


.  a 
5S 
-'- 

o  += 

-  --■ 

a.  ' 
r. 


is 

< 


73.6 
67  3 

Til  ii 
87.8 

70.0 

7li  7 
80.0 
7^  :; 
79  :; 


25.6 

2.5 

11.1 

34  9 

L2  2 

20.5 
39.5 
31  7 


Test  No.  2, 
Subtraction' 


Median 


o  a 
—  » 

a  ^ 


12 

9 

12 

VI 

11 
9 

I:' 
1-' 
12 


&3f 


76  7 
76.0 
85.0 
95  r, 

76.0 
82.3 
96 

s.,  9 


12.8 

5.0 

31.1 

19  5 

I'.i  5 

20  5 
:.s  i 
56  I 


Test  No.  3, 
Multiplication 


Median 


-  - 

-  ' 


o  -• 


94.5      62.8 


10 
9 
9 

111 

'.i 
9 
111 

10  J 
11 


70.0 
62  (i 
76.0 
84  5 


£ 
H 


Test  No.  4, 
Division 


Median 


o  S 


7  7 
12  5 

1  1 
20  9 


85.7      l'i  5 


3    . 

S3 


70.0 
85  :, 


7''   2 


ss  5 


36  6 


in 

in 


97  5 


7  7 
in  ii 
111 


Id      (Hi  7      34  9 

7  78  6  1  i  ii 
7  81.8  111 
is  8 
61.0 


44.2 


Speed  or  attempts— Number  of  examples  completed  by  pupils.  Accuracy  i>r  per  cenl  of  rights  Per 
cent  of  examples  correctly  sol ved.  Efficiency  -Pei  cenl  of  class  who  reach  or  exceed  the  established 
standard. 


[8 


Bulletin  of  the  Extension  Division 


Certain  facts  should  be  noted.  No  teacher's  name  appears  on  the 
i  port. 

Low  scores  may  not  mean  poor  teaching;  perhaps  it  is  a  class  of 
backward  pupils. 

These  reports  are  statements  of  fact  about  which  the  mother  and 
the  teacher  can  have  no  difference  of  opinion. 

The  principal  can  either  justify  the  results  or  undertake  to  improve 
them  if  they  cannot  be  satisfactorily  explained. 

(b)Graphic.  The  statistical  results  were  also  furnished  the  princi- 
pals in  graphic  form,  some  of  which  will  be  here  presented  and  explained. 

A  commendable  school  record: 

DEPARTMENT     OF    EDUCATIONAL      INVESTIGATION     AND    MEASUREMENT 


5d,o0i   n 


■REPORT       TO     MASTtR    AND     TO  ASSISTANT      SllPEHlNT  ENDE  UT 

SPEED  ACCURACY 


Gt»,U  VIII 


MITHME.TIC 

Test      M*y    lSlfc 
Sei-.ts     B 
For-m      3 


B 

STAN  DARDS 


5  -X 

L.L 

C±^ 

pi  ■ 

EtJLB 

-S_l 

^'i 

M_^ 

■■, 

1.1 

in 

:  -< 

i» 

.■ 

;*. 

!« 

-i  1 

li- 

1! 

:l 

i.. 

ft 

■/■■ 

V 

jo 

).. 

..* 

■45- 

JO. 

\ 

10 

/ 

'. 

\      / 

... 

I.< 

11 

-14- 

H 

\ 

/ 

\ 

4- 

V' 

-/ 



13 

.  I* 

-  13- 

^i.i- 

A 

/ 

/ 

/       / 

.  HI 

15 

/ 

BS 

11 

a 

/ 

ia 

/  N 

—  -■ 

(3- ■ 

/  X" 

;  . 

: 

ii 

,. 

I 

* 

I 

ji 

-LI— 

to 

JO- 

_15 

13 

15 

-8£ 

. 

In 

4 

-9. 

H 

B 

.if. 

* 

n. 

|i 

_*S 

-15 

•  i 

13 

i 

_1_ 

'■ 

...«-•- 

fa 

.    6 

i 

5L 

bo 

-10 

(A 

10 

s. 

,_?>_ 

-4- 

-4- 

Teacher  A's  class  is  above  the  standard  in  speed  in  every  process, 
altho  the  number  of  attempts  never  goes  beyond  13.  This  class  works 
with  an  accuracy  of  over  !)0  per  cent  in  every  process. 


Educational  Measurements 


49 


The  record  of  Teacher  B's  class  follows  closely  that  of  Teacher  A's 
class.  This  class  has  a  median  achievement  of  12. 6  examples  attempted 
in  addition,  13.2  examples  in  subtraction,  11.7  examples  in  multiplica- 
tion, and  11.8  examples  in  division.  The  accuracy  also  is  above  the 
standard  in  every  instance. 

This  is  a  most  commendable  school  record.  It  should  be  contrasted 
with  that  of  School  I.  In  School  I  one  class  conforms  to  the  standard  in 
speed  and  works  with  almost  ideal  accuracy,  while  the  other  class  is  far 
below  the  standard  in  both  speed  and  accuracy.  In  School  II,  on  the 
other  hand,  both  classes  conform  to  the  standard  in  speed  and  work 
with  almost  ideal  accuracy. 

VARIATION  IN  CLASSES 

DEPARTMENT    OT    EDUCATIONAL     INVESTIGATION     AND    MEASUREMENT 


Bt.?OHT       TO       rlASTCR    AND     TO     ASSISTANT      3UP£RlNr[NI>tNr 


6chool       I 


SPEED 


ARlTHMtTIC 


*JC  StJfl 


Tut       Nij    i9l fc 

S»-rie»    B 
FoT-m     1 


STANDARDS 


T«*<.f*T3 


a. 


fi 


/ 


n 


G-n.lt.  vim 


4— 2567U 


50  Bulletin  of  the  Extension  Division 

The  median  achievement  of  Teacher  A's  class  is  higher  than  the 
standard  in  both  speed  and  accuracy  in  every  process.  It  attempts  14.0 
examples  in  addition,  14.4  examples  in  subtraction,  11.8  in  multiplica- 
tion, and  12.7  in  division.  Its  per  cent  of  accuracy  is  98  in  addition  and 
subtraction,  and   100  in  multiplication  and  division. 

The  median  achievement  of  Teacher  B's  class  is  below  the  standard 
in  both  speed  and  accuracy  in  every  process.  It  attempts  10  examples 
in  addition,  11.3  in  subtraction,  9.6  in  multiplication,  and  9.3  in  division. 
It-  accuracy  ranges  from  68  per  cent  in  multiplication  to  83  per  cent  in 
division. 

This  graph  shows  that  there  is  a  great  difference  in  the  arithmetical 
achievements  of  the  two  eighth-grade  classes.  The  question  for  the  prin- 
cipal is,  "What  is  the  reason  for  this  in  this  particular  class?"  The 
results  indicate  unusual  conditions  and  do  not  show  whether  these  are 
due  to  poor  teaching,  with  a  failure  to  get  at  the  individuals  who  are 
responsible  for  the  low  class  median,  or  whether  they  are  due  to  some 
legitimate  cause. 


Educational  Measurements 


51 


SPLENDID  SCHOOL  RECORD 

DEPARTMENT   OF    EDUCATIONAL     INVESTIGATION     AND   MEASUREMENT 

REPORT    TO     MASTER.    AND    TO  ASSISTANT    AUPERMMTENDENT 


School    ST 


Grade  VII! 


ARITHMETIC 

Tts1_   May  1916 

StTltS     0 
FoTT-n     3 


SPEED 


ACCURACY 


leaches  A 
0 

C 


STANDARDS 


Ae» 

> 

Sii 

n" 

PIV 

•19 

19 

-18 

16 

18 

-16 

17 

i/ 

■17. 

-17 

ifc 

J«>. 

■Pfe 

it. 

IJ 

IT 

15 

15- 

-14- 

14 

i« 

■  14- 

■13- 

1) . 

s 

■13 

\ 
\ 

\ 

-12 

• 

-12. 

\i 

u 

il 

T7~ 

il 

n 

.10. 

-IO. 

.10. 

-IO. 

9 

9 

.9. 

1 

8. 

a. 

a 

a. 

7 

-7 

7 

7 

6. 

6 

6 

(,. 

r. 

5 

s 

5 

1 

4 

Ji 


-70 


60 


tvs 


80. 


7^- 


JJi 


fiff 


75 


7" 


69 


(..' 


/ 


/ 


65 


1,0 


75 


All  three  classes  are  above  the  standard  in  attempts  in  everj 
process,  but  in  no  case  does  a  class  attempt  more  than  14  examples. 
Every  class  works  more  accurately  than  the  standard  requires.  While 
the  standard  in  accuracy  in  addition  is  80  per  cent,  Class  A  achieves 
92  per  cent  in  that  process;  Class  B,  95  per  cent;  and  Class  C,  89  per 
cent.  The  standard  in  subtraction  is  90  per  cent,  while  each  of  these 
classes  works  with  an  accuracy  of  97  per  cent  or  higher  in  this  process. 
While  the  standard  accuracy  is  80  per  cent  in  multiplication,  86  per  cent 
is  the  lowest  median  achievement  of  these  classes.  All  three  classes 
work  with  a  median  accuracy  of  100  per  cent  in  division. 


52 


Bulletin  of  the  Extension  Division 


UNSATISFACTORY    LOW    SCHOOL   RECORD 

DEPARTMENT    OF    EDUCATIONAL     INVESTICATION      AND    MEASUREMENT 

RCPORT     TO     MASTER    AND    TO  ASSISTANT     SUPERINTENDENT 


ScKooi  m 


OT»^tVMI 


SPEED 


ACCURACY 


ARITHMETIC 


Sexies  B 
ro-r-m     3 


STANDARDS 


Teacher-  A 
B 

c 

D 


A 

-     M 

n_i  , 

>.i » 

.19. 

.13. 

IS 

i.* 

It 

18 

17 

'7 

17 

17 

Jfc- 

. 

II 

!• 

-I5- 

-IS- 

.13 

11 

-|4. 

,1 

14 

11 

P3 

►  15 

11 

I) 

IT. 

17 

L' 

il 

ii 

V< 

.u. 

\ 

•IO 

-IO, 

6 

— 
9 

e 

•  — 
■7  ■ 

\ 

'1 

-P. 
-  .  _ 

-3- 

\ 

-7 

-7- 

6 

-t  - 

V 

^ 

-6- 

.■> 

-5  - 

J  . 

-S  . 

-4. 

4  - 

i 

Teacher  A's  class  attempts  a  median  of  only  10  examples  in  addi- 
tion, 10.3  examples  in  subtraction,  8  examples  in  multiplication,  and  6.8 
examples  in  division.  The  accuracy  of  this  class  ranges  from  62  per 
cent  in  addition  to  89  per  cent  in  subtraction. 

Teacher  B's  class  attempts  a  median  of  9.6  in  addition,  11  in  sub- 
traction, 9.1  in  multiplication,  and  8  in  division,  with  an  accuracy  of 
55  per  cent  in  addition,  83  per  cent  in  subtraction,  68  per  cent  in  multi- 
plication, and  78  per  cent  in  division. 

Teacher  C's  class  attempts  a  median  of  9.3  in  addition,  10.3  in  sub- 
traction, '.)  in  multiplication,  and  7  in  division,  with  an  accuracy  of  59 


Educational  Measurements 


53 


per  cent  in  addition,  86  per  cent  in  subtraction,  67  per  cent  in  multipli- 
cation, and  84  per  cent  in  division. 

Teacher  D's  class  attempts  a  median  of  9  in  addition,  10.1  in  subtrac- 
tion, 7.7  in  multiplication,  and  7.3  in  division,  with  an  accuracy  ranging 
from  63  per  cent  in  multiplication  to  88  per  cent  in  subtraction.  All 
these  class  records  are  far  below  the  standard  for  the  grade  in  both 
speed  and  accuracy  in  every  process. 

ACCURACY   SACRIFICED  TO   SPEED 

CCPAnTMENT   0T    EDUCATIONAL    INVESTIGATION      AND    MEASUREMENT 

RETORT      TO     MASTER     AND    TO     ASSISTANT        Sl/PESlNTEKBCNT 


School     T5 


G  ra  dc 


5PECD 


ACCURACY 


ARITHMETIC 


T*st       May    13  <b 
S«-rfe  s    O 
FoTTn  J. 


TtdChCT  A 


Itdchcr    13 


Tcd.clie-r  C    


STANDARDS 


-1  I, 

J9_ 

_1S_ 

" 

JC0_ 

li< 

J8- 

-IB. 

■ 

1  1 

1 

/ 

/ 

_11. 

\ 
\ 

JLT_ 
H 

1 

.It. 

— 

.14- 

\   \ 

V 

33 

jot 

/ 

-9i- 

—, 

j-i- 

-14- 

:  • 

/ 

-13. 

-^  ^ 

-^ 

.11. 

If 

\ 

£ 

7" 

y> 

— 

\ 
\ 

Li 

ji 

30 

■i 

_1L 

J-L. 

.10. 

.10. 

.15". 

.85. 

11. 

IV 

_io. 

.3. 

3 

' 

i 

.  H  . 

.8- 

h10- 

V 

,-10. 

to 

■ 

l 

1 

-7- 
i 

*-i 

-15. 

,_i-f 

.15. 

^ 

(-  - 

"5 

,  , 

be 

'. 

■i 

■■ 

1 

1 

Teacher  A's  class  attempts  17.5  examples  in  addition,  18.5  examples 
in  subtraction,  16  examples  in  multiplication,  and  21  examples  in  division 


54 


Bulletin  of  the  Extension  Division 


— a  record  m  attempts  far  beyond  the  standard.  The  accuracy  ranges 
from  91  per  cent  to  100  per  cent. 

Teacher  B's  class  attempts  L4.8  examples  in  addition,  16  examples 
in  subtraction,  13  examples  in  multiplication,  and  15.6  examples  in 
division,  with  accuracy  above  the  standard  in  every  process. 

Teacher  C's  class  attempts  13.5  examples  in  addition,  14.8  examples 
in  subtraction,  12.8  examples  in  multiplication,  and  14.7  examples  in 
division,  with  84  per  cent  accuracy  in  addition,  94  per  cent  in  subtraction, 
84  per  cent  in  multiplication,  and  94  per  cent  in  division. 

All  three  classes  are  far  beyond  the  standard  in  speed  while  they 
work  more  accurately  than  the  average  eighth-grade  pupil. 

It  is  regrettable  that  in  this  school  there  was  enough  overtraining 
in  the  fundamentals  to  bring  the  class  medians  in  speed  so  far  above  the 
standard.  The  high  accuracy  is  to  be  commended,  for  accuracy  of  100 
per  cent  is  the  ideal  to  be  striven  for;  the  standard  represents  what  we 
may  reasonably  expect  every  class  to  achieve.  If  the  attempts  were 
nearer  the  standard,  this  would  be  an  example  of  the  kind  of  school 
record  that  is  desirable. 


PUPILS'   RECORDS   FOR   TEACHERS 

The    record   of  each   pupil   in   each   standard   test  was  recorded  on   a 
card  which  passes  along  with  him  as  he  is  promoted  from  grade  to  grade. 


RECORD  OF  ARITHMETIC  TESTS 


Be  IST(  IN 
22-i      PUBLIC 

SCHOOLS. 

Name    ....  (Boy  or  Girl) ^ge  at  first  trial  (on  last  birthday) 

School  District   School  .  .  City 


(  trade 

Seiues  B 

TEST    1,    ADD. 

TEST    2,   SI    1-  1 

TEST  3,    \lt    IT. 

TEST    4,    DIV. 

])\TE 

X 

o. 

£ 
S 

< 

1. 

Attempts 

■i. 

O. 

ID 

2 

Attempts 

M 

5 

1st    Trial 

2d    Trial 

3d    Trial 

4th    Trial 

5th   Trial 

6th   Trial 

7th  Trial 

Educational  Measurements 


55 


< 
Q 
Z 
<J 
H 
m 

> 

H 
O 

w 

1-5 

M 

O 

in 

> 

< 

w 

Q 


56  Bulletin  of  the  Extension  Division 

The  curd  indicates  to  the  teacher  the  pupil's  ability  or  inability  to 
perform  the  work  in  the  four  fundamental  processes,  so  that  she  knows 
his  needs  and  can  govern  her  work  accordingly. 

The  above  graph  shows  the  objective  standard  of  achievement  in  the 
four  fundamental  processes  in  arithmetic  for  an  elementary  school  grad- 
uate, based  on  what  50  per  cent  of  the  eighth-grade  children  in  Boston 
have  shown  they  are  able  to  do.  Therefore,  it  indicates  a  reasonable 
accomplishment,  and  one  presumably  within  the  grasp  of  a  large  pro- 
portion of  eighth-grade  pupils. 

This  standard  is  not  the  ideal,  however.  An  ideal  represents  some- 
thing striven  for  but  not  usually  attained.  In  the  case  of  the  number 
of  examples  attempted  the  standard  may  well  be  considered  the  ideal. 
As  to  the  degree  of  accuracy,  altho  a  standard  of  80  per  cent  has  been 
set  up  as  what  may  reasonably  be  expected,  the  ideal  of  accuracy  is 
always  to  be  100  per  cent.  The  graph  shows  the  present  standard  in 
relation  to  the  ideal.  A  record  which  is  near  the  standard  and  which 
shows  uniform  ability  as  to  the  amount  of  work  done,  and  also  as  to 
accuracy,  approaches  the  ideal. 

UNIFORM   ABILITY   IX  OPERATIONS 

It  is  desirable  that  a  person  should  be  equally  proficient  in  the  four 
fundamental  processes  of  addition,  subtraction,  multiplication,  and 
division.  If  he  is,  he  will  be  able  to  attempt  the  same  number  of  exam- 
ples in  each  process  in  the  Courtis  Standard  Tests  and  will  work  as 
accurately  in  one  process  as  in  another.  As  a  matter  of  fact,  however, 
too  few  pupils  approximate  to  this  achievement.  The  following-  picture 
shows  the  record  of  a  child  whose  achievements  approach  the  ideal  in 
this  respect. 


Educational  Measurements 


57 


O    H 


a* 

cn 
Ui 

h 

o 

Ul 

5 

z 

£ 

< 

u. 

o 

o 
e 
o 
u 
in 


58  Bulletin  of  the  Extension  Division 

When  she  first  took  the  test  she  showed  uneven  ability  in  the  proc- 
esses. She  was  slow  in  addition  as  well  as  inaccurate.  She  was  ac- 
curate in  subtraction  but  slow.  Her  multiplication  score  was  up  to 
eighth-grade  standard  in  the  number  of  examples  correct,  while  the 
division  score  was  up  to  the  eighth-grade  standard  in  speed  and  showed 
perfed  accuracy.  In  the  second  test  she  showed  remarkable  uniformity 
in  the  tour  processes,  attempting  12  examples  in  every  process  but  ad- 
dition, in  which  she  attempted  13,  and  in  every  case  getting  12  examples 
correct. 

The  second  .-core  is  an  unusual  one  and  shows  an  ideal  toward  which 
to  work.  While  the  number  of  examples  attempted  is  at  or  near  the 
standard,  there  is  at  the  same  time  almost  perfect  accuracy.  Just  enough 
emphasis  has  been  placed  on  the  deficiencies  in  addition  and  subtraction 
to  eradicate  them  and  bring  ability  in  those  processes  up  to  the  ability 
in  multiplication  and  division.  The  record  of  this  girl  is  the  kind  of 
standard  to  which  to  look  forward.  It  would  constitute  a  better  standard 
than  the  present  one. 

VARYING  ABILITY  IN  OPERATIONS 

In  contrast  to  the  foregoing,  the  following  is  an  example  of  ex- 
treme variability  in  ability  to  add,  subtract,  multiply,  and  divide. 


Educational  Measurements 


59 


(A  ( 

1--10   w-s 


fflaui 


V 

£ 
z 


ai 

09 

V 

w 

.0- 

Q 

E 

< 

0) 

X 

K 

W 

O 

0 

o 

u 

Z 

u 

60 


Bulletin  of  the  Extension  Division 


The  record  show.-,  varying  ability  in  the  four  fundamentals.  The 
addition  score  is  10  examples  attempted  with  only  4  right,  a  record 
below  the  eighth-grade  standard  both  in  speed  and  accuracy;  the  multi- 
plication score  is  11  examples  attempted  with  only  5  right,  a  record  up 
to  the  eighth-grade  standard  in  speed  but  far  below  it  in  accuracy.  The 
subtraction  and  division  scores  are  above  the  eighth-grade  standard.  In 
both  these  processes  the  girl  attempts  12  examples  and  performs  the 
work  with  perfect  accuracy. 

This  card  shows  clearly  where  drill  is  needed.  If  this  child  had 
taken  the  tests  in  earlier  years,  the  weakness  in  addition  and  multipli- 
cation would  have  shown  in  the  previous  records,  thus  making  it 
possible  for  the  teacher  to  correct  it.  As  it  is,  this  girl  graduates  from 
an  elementary  school  with  no  greater  ability  to  do  arithmetical  problems 
than  that  represented  in  her  addition  and  multiplication  scores.  Only 
individual  drill  on  addition  and  multiplication  will  help  this  girl. 


ERADICATION  OF   A   DEFICIENCY 

When  a  pupil  finds  difficulty  in  performing  examples  in  any  one  of 
the  four  processes,  he  needs  special  help  in  order  to  get  at  the  root  of  his 
trouble.  The  teacher  must  first  find  out  what  his  trouble  is  and  then 
apply  the  remedy.  No  amount  of  general  class  drill  will  eradicate  his 
particular  deficiency.  Each  pupil  must  be  given  the  opportunity  to  prac- 
tice on  the  process  with   which  he  finds  most  difficulty. 

The  following  is  a  record  of  a  pupil  whose  teachers  helped  him  to 
overcome  difficulties: 


Educational  Measurements 


fil 


fo 


01 

3 

> 

o 

c- 

Q 

H 

0) 

o. 

E 

1> 

4 

0--     <N 

< 

1 

tn 

-* 

^ 

c 

a 

U. 

s 

in 

H 

to 

a 

E 

4> 

^ 

S" 

-    c 

CD 

< 

CO 

H 

s 

Ed 

m 

(fl 

*■* 

I 

fcUJ 

^5 

&o 

_ 

D 

eg 

F- 

0) 

c/" 
W 
r- 

a 
E 

< 

<x 

\       ^ 

— - 

(/) 

J 

Q 

r, 

.c 

to 

X 

^ 

3 

a 

» 

< 

H 

tn 

H 

- 

E 
< 

:* 

■       O 

<T 

1 

"3 

.2 

03 

a 

03 

.2 

a 

H 

h 

h 

H 

h 

H 

+* 

T3 

C 

•O 

x: 

J3 

£. 

s: 

^^ 

c< 

to 

*• 

m 

<o 

t» 

H 

"    t- 

C7* 

CC 

CS 

* 

y 

-J 

■"■* 

-*" 

u 

. 

-       , 

< 

^ 

o 

'r> 

Q 

""■ 

"" 

•3 

o 

j 

■      : 

■ri 

r^ 

■  J 

62  Bulletin  of  the  Extension  Division 

By  examining  the  record  of  this  boy  it  will  be  seen  that  there  was 
constant  improvement  in  both  speed  and  accuracy  in  every  test.  In  the 
first  test  the  record  shows  particularly  slow  and  inaccurate  work  in 
addition  and  division.  In  addition,  4  examples  were  attempted  with  only 
2  fight,  while  in  division  only  4  were  attempted  with  0  right.  While 
the  subtraction  and  multiplication  scores  were  not  so  bad  as  these,  they 
were  far  below  the  standards  for  the  grade.  In  the  second  test  the 
record  shows  improvement  in  every  process.  That  in  division  was  pai1- 
ticularly  good  while  that  in  addition  was  only  slight,  for  the  work  in 
this  process  was  still  very  inaccurate.  In  the  third  test  perfect  accuracy 
has  been  attained  in  every  process,  altho  with  the  exception  of  division 
the  speed  is  not  quite  up  to  the  eighth-grade  standard.  The  graph  on 
the  following  page  shows  how  the  uneven  line  of  achievement  has  been 
gradually  straightened  in  the  three  successive  tests. 

This  record  proves  the  value  of  its  existence.  As  a  result  of  the 
first  test,  the  individual's  troubles  in  the  fundamentals  were  diagnosed 
and  treatment  applied  where  it  was  most  needed;  i.e.  in  division.  The 
second  test  showed  the  efficacy  of  this  treatment.  Addition,  however, 
still  made  a  poor  showing.  The  third  test  showed  that  proper  treatment 
had  been  applied,  for  this  deficiency  was  overcome.  The  speed  still  was 
below  the  standard.  However,  perfect  accuracy  was  reached  and  but 
little  extra  drill  on  addition  and  multiplication  would  be  needed  to  make 
of  this  an  ideal  score. 

PERSISTENCE  OF   A  DEFICIENCY 

In  contrast  to  the  foregoing,  the  following  pictm-e  shows  a  record 
of  a  pupil  whose  teachers  have  failed  to  help  her  to  overcome  her  dif- 
ficulties: 


Educational  Measurements 


<;.-, 


c^ 


w 
u 

in 

< 


(J 


o 
o 

o 


o 
o 

s: 
o 


c-    -: 


•* 


*    ^ 


Oo 

fo 

i 

*\     H 


— i    ^) 


1 


'*> 

*> 

s 

w 


Q 


"3 

~s» 

a 

■ 

a 

a 

H 

1* 

H 

H 

c 

•a 

.c 

Xi 

.c 

" 

tN 

to 

t 

vn 

to 

f'         H        ^= 


r^i      o« 


H 

< 


I 


<3"» 


64 


Bulletin  of  the  Extension  Division 


The  girl's  first  test  showed  that  at  the  beginning  of  the  seventh  year 
her  subtraction  and  multiplication  scores  were  already  up  to  seventh- 
grade  standards;  her  division  score,  altho  up  to  seventh-grade  standard 
in  speed,  was  rather  inaccurate;  in  addition  the  record  showed  that  she 
was  slower  and  more  inaccurate  than  the  average  fifth-grade  child.  At 
the  end  of  that  year,  her  second  test  score  showed  that  she  had  increased 
her  speed  in  every  process  altho  her  accuracy  had  not  improved;  in 
fact,  in  multiplication  she  was  more  inaccurate  than  in  the  first  test. 
At  the  end  of  the  eighth  year  the  third  test  showed  wor-k  that  was  at  or 
above  the  eighth-grade  standard  in  all  processes  but  addition.  In  this 
process  the  deficiency  was  still  apparent. 

The  record  showed  both  the  seventh-.urade  teacher  and  the  eighth- 
grade  teacher  that  this  child  was  meeting  some  difficulty  in  her  addition 
work  that  prevented  her  doing  accurate  work.  An  effort  should  have 
been  made  to  discover  that  difficulty  and  eradicate  it. 

STRIKING    ACCURACY 
Perfect  accuracy  is  always  the  ideal  to  be  striven  for. 


Educational  Measurements 


65 


j8PQ 


en 


W 

u 

tG 

a 

v 
to 


(0 

I- 
V) 

id 

h 

O 
Id 

z 

E 

< 

O 

Q 

a 
O 

o 
Id 


CD 


HjO 

O  D  u 

s  a.  i/i 


d 


V 

s 

2 


o 


o 
o 

c/3 


o 
'C 


o 
o 

x: 
o 


r9 


Id 
H 
< 

Q 


r> 


in 

i 

© 

til 

K 

en 

a 

fi 

^*- 

V 

< 

* 


r  -}- 


<M 


^h 


c^    * 


W 


«*s> 


c 


6° 


cJ 


^ 


^ 


"bo      W 


rl 


eg 


2 


C* 


<y> 


^i     o-r 


ay 
16 


I 

3 


\ 


5—25670 


66 


Bulletin  of  the  Extension  Division 


The   seventh-grade   record   of  A S........    in   1915   shows 

good  accuracy  except  in  addition.  The  speed  is  up  to  or  above  eighth- 
grade  standard  in  every  process.  Her  eighth-grade  record  in  1916  shows 
perfect  accuracy.  There  has  been  a  slight  gain  in  attempts  except  in 
addition,  where  the  accuracy  is  attained  at  the  expense  of  speed,  which 

in  this  case  could  afford  to  suffer.     The  record  of  O P 

in  1915  shows  good  accuracy  with  the  attempts  up  to  or  above  seventh- 
grade  standard  except  in  addition.  In  1916  he  attempts  just  12  ex- 
amples in  every  process  except  addition,  where  he  attempts  11,  and  works 
with  perfect  accuracy  in  all  four  processes. 

These   records   show   striking  accuracy.      The   May,   1916,   record   of 

0 P is  almost  ideal,  for  altho  the  attempt  score  falls 

slightly  below  the  standard  in  addition,  the  accuracy  is  perfect.  Just 
enough    drill    work   has   been    afforded    to   keep    the   attempts    near   the 

standard.     The  1916  record  of  A S is  ideal  as  regards 

accuracy.  The  change  in  the  addition  score  from  21-14  to  16-16  is  es- 
pecially noteworthy.  The  attempts,  however,  exceed  the  standard,  show- 
ing that  the  girl  did  not  need  so  much  drill  as  she  was  given  in  order  to 
maintain  her  speed,  which  was  already  near  the  standard. 


Educational  Measurements 


67 


ANOTHKR  case 


id 


u 


4J 
< 


(0 

H 
(0 

o 

u 

S 

X 
K 

E 

< 


Q 
tr 
O 
O 

u 


CJ^ 


M 


5"o 

t-JO 
w  II 

O  D  O 

■  lit 


n 


6 

2 


o 


o 
o 
x: 
o 
CO 


o 
o 
x: 
o 
m 


tf> 

^>0 

*  ^ 

i 

^ 

V 

** 

t 

0 

t 
E 

a 

r 

5  8. 

2  1 

6o 

1 

^ 

™*- 

< 

( 

if 

cr 

■  5 

* 

: 

3    « 

i 

> 

1 

n 

i 

-    £ 

c 

i 

m 

-     w 
< 

1 " 

Gfl 

1 

III 

01 

»  5 

^ 

> 

-    h 

-       2 

Nil 

•a- 

V 

~ 

< 

tn 

^S 

5* 

t>s 

2     X 

< 

m       tn 

f>     a. 

t* 

^< 

I  ^ 

^- 

< 

ts 

15 

15 

75 

73 

73 

t-. 

H 

1* 

H 

h 

H 

♦■» 

T3 
C 

T3 

x: 

X! 

Si 

x; 

CN 

CO 

•<r 

m 

<o 

t-~ 

UJ 

3 
5 

(S 

-3 

t" 

1 

> 

a  ** 

d 

-*  ~1  -* 

~n 

■^ 

u 

U 
< 

H 

n 

0 

Q 

3 

c 

i 

" 

<d 

** 

. 

68  Bulletin  of  the  Extension  Division 

The  first  test  showed  that  this  boy  when  in  grade  six  possessed 
ability  above  that  of  the  majority  of  eighth-grade  pupils  in  all  processes 
except  multiplication.  He  worked  with  almost  perfect  accuracy.  This 
striking  accuracy  was  repeated  in  both  the  second  and  third  tests.  There 
was  little  gain  in  attempts.  In  fact,  there  was  a  falling  off  in  the  num- 
ber of  examples  attempted  in  division.  At  the  end  of  the  eighth-grade 
year  he  was  above  the  eighth-grade  standard  in  speed  and  worked  with 
splendid  accuracy. 

This  hoy  showed  in  his  first  test  a  reasonable  achievement  in  the 
four  processes;  in  fact,  a  higher  achievement  than  50  per  cent  of  the 
eighth-grade  pupils  in  the  city  attained.  There  was  no  necessity  of  his 
receiving  more  than  a  very  small  amount  of  drill  on  the  fundamentals. 
Apparently  he  received  a  very  slight  amount,  as  his  last  test  showed 
little  gain  in  ability  over  the  first  test.  This  is  as  it  should  be.  The 
energy  necessary  to  raise  this  child's  score  higher  was  being  rightfully 
applied  to  other  more  important  things. 

EXCEPTIONALLY   HIGH   AND  LOW   ABILITY 

Examples  of  two  kinds  of  records  follow:  those  of  pupils  whose 
native  ability  is  high  and  those  of  pupils  whose  native  ability  is  low. 
Naturally  children  of  the  first  type  require  much  less  training  to  bring 
them  to  a  certain  level  than  the  children  of  the  second  type  require.  The 
former  will  profit  far  more  by  a  class  drill  than  the  latter.  In  this 
difference  in  capacity  lies  the  greatest  reason  for  the  need  of  training- 
suited  to  the  individual. 


Educational  Measurements 


69 


fT)  H 


f- 


1(3 


C 

us 

« 

bo 
< 


'6 


o 
o 

u 

C/3 


0H0 


S55<3P 


a  a  in 


n 

N 


1) 

E 
z 


to 
5 

"o 
o 

£. 
a 
C/3 


1 

<j 

> 

en 

.5? 

n 

a 

•n 

•J 

C 

en 

c. 
E 

V 

< 

\ 

v£ 

- 

Q 

d 

H 

J 
D 

en 

*) 

^ 

cr 

^ 

2 

t- 
H 

en 

a 

E 

^ 

cr 

«v 

Q 

6c 

QQ 

< 

M 

s 

ID 

X 

0 

eN 

0) 

.e 

*) 

i 

Oo 

IS 

W 

tn 

a 
E 

< 

■* 

cr 

ri 

0 

en 

o 

c4 

n 

< 

en 

a 

cO 

* 

b 

V 

< 

<0 

^ 

^ 

13 

3 

CO 

* 

C3 

CO 

■  ?.         11 

H 

In 

H 

*J 

c 

•o 

X! 

.c 

A 

.c 

^h 

IN 

to 

*»■ 

m 

IO 

r~ 

■< 
Be 
(9 

t- 

•    r 

t- 

C*- 

eff« 

^ 

'f 

^ 

U 

cr 

a 

<3~ 

o- 

■■. 

DATE 

H 

a 

— 

■•o 

3 

~ 

o 

Li 

O 

_j 

CO 


70  Bulletin  of  the  Extension  Division 

In  the  first  test  the  girl  designated  as  "A"  had  a  low  achievement 
in  every  process.  In  no  process  was  her  achievement  so  high  as  the 
fourth-grade  standard.  In  the  second  test  her  record  showed  some  in- 
crease in  the  number  of  attempts,  but,  on  the  whole,  none  in  accuracy.  In 
no  process  was  the  standard  for  grade  seven  reached.  On  the  other 
hand,  the  girl  "B"  had  a  high  achievement  in  the  first  test;  in  fact,  in 
this  test  her  achievement  exceeded  the  eighth-grade  standard  in  all 
processes  but  multiplication,  which  was  already  above  seventh-grade 
standard.  In  the  second  test  she  performed  from  5  to  11  more  examples 
in  each  process  than  she  did  in  the  first  test,  with  no  loss  in  accuracy. 

A,  who  needed  much  more  drill  work  in  this  year  than  B,  got  in  all 
probability  the  same  amount.  B,  possessing  a  high  achievement  to  begin 
with,  and  needing  practically  no  further  drill  as  it  was  evident  that 
there  were  no  serious  difficulties  for  her  to  overcome,  profited  far  more 
from  the  class  drill  than  A  possibly  could  until  her  own  individual  dif- 
ficulties were  overcome.  Thus  A  did  not  get  so  much  drill  as  she  needed, 
while  B  was  forced  to  spend,  on  drill  which  was  unnecessary  for  her,  time 
that  could  have  been  more  profitably  employed  on  some  other  work. 


Educational  Measurements 


71 


h 

id 

h 


Id 

z 

E 

< 

O 

a 

a 
O 
o 

UJ 


max 

030 

c  o.  ic 


5 

CD 


o 
o 

o 
to 


o 


O 

o 
s: 
o 


* 

> 

7N 

n 

l 

1 
( 
I 

.       DC 

2 

d 

rl 

W 

* 

^ 

C^ 

■  -*- 

( 

1 

1     b 

V 

1 

I 

d 

<S 

«s 

M 

s 

» 

£ 

'   cr 

1 

j  •- 

3    " 

<s 

«* 

£ 

*> 

-    2 

«< 

r 

-     o 

o- 

» 

X 

6     E 
-      u 

< 

tf 

C^ 

Id    - 

X 
til 
(fl 

-     -J 
33     (2 
D 
10 

r- 

> 

r,     a 

jJ    E 

;*• 

Oo 

t 

c^ 

< 

w 

« 

c^ 

£ 

a  .a? 
a  * 
< 

0 

-   & 

;t 

CO 

■t 

CC 

H     5 

< 

31  rt 

c( 

•s 

"fl 

03 

ol 

a 

03 

<n 

H 

h4 

h 

h 

!■! 

c 

T3 

£ 

a 

j= 

.c 

r-« 

r< 

f<1 

* 

\r> 

<o 

r^ 

6o 

0° 

Oc 

bo 

•J 

1 

1 

1 

Id 

2 

*  1* 

Tf 

ji 

r- 

1 

< 
3 

1 

La 

2 

] 

i 

J 

72  Bulletin  of  the  Extension  Division 

In  this  test  all  these  pupils  achieved  a  -core  far  beyond  the  standard 
set  for  eighth-grade  pupils.  They  succeeded  in  performing  correctly 
most  of  the  examples  which  they  attempted. 

Pupils  who  can  achieve  such  high  scores  as  those  shown  here  demon- 
strate that  their  native  ability  is  high.  Hence  they  need  a  minimum  of 
training  in  order  to  bring  to  a  desirable  height  their  ability  in  this 
particular  subject.  When  that  desired  height  has  been  reached  (in  this 
case  the  objective  standards  of  achievement  in  the  four  fundamental 
processes)  then  time  available  for  further  training  should  be  devoted  to 
other  branches  of  learning. 


Educational  Measurements 


73 


r 


a 


■1  H 


u 

+■> 

.<* 
V 

< 


CO 
H 
(/> 
til 

h 


I- 
UJ 

2 

z 

H 

< 

u. 
O 


:0 


o 


o 
o 
JB 
o 
03 


o 
'C 

5 

"o 
o 

SI 

<J 


1 

f 
c 
c 
f 

be 
3 

«) 

lo 

Q 

j 

5'  £ 
S  1 

< 

1 

T> 

.  * 

>-    i 

( 

: 
« 

1 

CO      ' 

3  5 

> 

") 

0 

- 

O 

i 

-  s 

d     E 

*4 

r- 

1 

lO 

c< 

< 

CO 

OS 

III 
en 

n 

0° 

^ 

-     ^ 

H 

i 

/;     a 

—        4> 
< 

cr 

t^ 

-     Oc 

*) 

1 

TFST    1     AUU. 
tempts     |        Rights 

*) 

«0 

o 

1 

S* 

* 

< 

"rt 

"3 

*3 

*M 

13 

a 

H 

£ 

H 

H 

c 

-o 

x: 

* 

je 

x: 

- * 

ts 

co 

>«■ 

lO 

IO 

r^ 

bo 

Sk 

^ 

v>= 

( 

( 

( 

d 

i 

3 

1 

L 

1 

{ 

^ 

V 

1 

i 
1* 

71  Bulletin  of  the  Extension  Division 

In    this    test    C E did    not    reach    the    sixth-grade 

standard  in  any  process;  no  one  of  the  other  pupils  whose  records  are 
shown  here  succeeded  in  achieving  in  any  process  what  50  per  cent  of  the 

children  in  grade  four  could  do,  with  the  one  exception  that  V 

A could  perform  in  division  a  little  more  than  the  fourth-grade 

median  achievement. 

Pupils  who  achieve  such  low  scores  probably  demonstrate  that  their 
native  ability  is  low.  Hence  they  are  individuals  who  certainly  need  a 
maximum  of  training  in  order  to  attain  a  satisfactory  ability.  If  one 
contrasts  these  records  with  the  foregoing  group  of  high  records,  one  can 
readily  see  the  need  of  drill  suited  to  the  individual,  as  class  drill  in 
the  fundamentals,  cannot  be  adapted  to  the  needs  of  all  members  of  the 
class. 

NOTEWORTHY   ANNUAL  GROWTH 

A  few  graphs  will  show  how  pupils  progress  satisfactorily  from  year 
to  year: 


Educational  Measurements 


75 


a> 

^H 

i 

J=  c 

^j 

5                    oo 

S                          «-i 
£                         — 

8                       ^* 





— 

V 

c 

— 

c 

X 

O                00 





— . 

K 

~v3 



—  % 



— 

<~     ^ 

■a 

<h 

.  \ 

\ 

~t 

t- 

c  X 

i-    2 

•*4 

VL 



_ — 

t 

rL 

_,. 

— a 

V 



I: 

4-1 
V) 

W 

IS 
*-> 
cd 

V 

bi 

< 

i               Y 





S 

M 

— 1 

1 

3 

0 

1 
1 

— 

f  1-8 

lis 

•;    0    o 

r  .    > 

c*                ■     — 



. — 

— 

— 

-\ 

1 



— 

y  -  cs 

<n    ^A 

5               %  ro 

gg   3 

3    3   O 

h         J 

0                                 » 

CO     -  A 

Ul       rsX 

i 

^ 

+3 

h     ^ 

i 

W   r. 

r   =  — 

o 

Ul 

SE 

.    

\ 

4 

r 

> 

— 

% 

v\ 

\1 

/  4 

// 

{/ 

»      1 

\ 

1 

i 

"  -.  * 

o  -  be 

z 

4 

> 

^33 
S    M   O 

t 

!                 ~~ 

^"f 

£ 

00 

S  J3  a 

< 

i                  — 



— 

— 

— 

— 

— 

— 

— 

Si — 

U. 

10       t. 

:e- 

O 



Q 

o 

VO 

>  —  < 

►  — < 



5jE 

o 

Ul 

CN 

1— < 



\ 
\ 

1 

< 

-/ 

/ 

/- 

> 

< 
> — i 

►—  < 

> 

< 

\ 
\ 

< 

< 

» 4 

> 

V 
A 

< 

> 

2  ""  i 

-   :/  " 

CO 

—  -r  - 

s 

•r  a  « 

.  "^ 
■;  X  - 

H  JO 

Max 

Q        E 

(0(0                      (AM                     CA                  CA                      (A                  (A 

u.                      u.        .£           "u.        jc,            o.jc 

g              tc               g              M                g              U-                g              M 

5^  = 

O  DO 

5        « 

<               < 

8*9 

■  a.  * 

<          x 

«                                 M 

M      ~>~^~Y~^**'          — .     *— "~r 

H    3    1 

( 

/^       ^ 

s      1 

3        a     "g 

■8 
5 

H       S          H         fe         h         £•          H        — 

W        ^          W         co         W          s           w 
K>                    H                      H                       H 

S"5 

76  Bulletin  of  the  Extension  Division 

interpretation  of.  the  above. record 

This  graph  shows  the  results  in  three  successive  tests  of  a  girl  who 
was  eleven  years  old  when  she  took  the  first  test.  The  first  test  was  in 
October,  1914,  when  she  was  beginning  the  work  of  grade  seven.  The 
second  test  was  at  the  end  of  that  year,  May  17,  1915.  The  third 
test  was  in  May,  1916,  when  she  was  about  to  graduate. 

In  the  record  of  the  first  test  the  graph  shows  an  uneven  line  due 
to  the  inaccuracy  in  addition  and  division.  The  accuracy  in  subtraction 
and  multiplication  is  good,  but  the  number  of  attempts  in  each  process 
is  only  4  or  5,  which  means  that  it  never  reaches  the  achievement  of  the 
ordinary  fifth-grade  child.  In  the  second  test  an  improvement  in  speed 
and  accuracy  is  shown.  The  girl  attempts  ■>  or  <i  examples  in  each  proc- 
ess and  performs  correctly  all  that  she  attempts.  The  speed  is  still 
below  the  fifth-grade  standard.  The  improvement  in  addition  and  di- 
vision, which  were  inaccurate  before,  is  especially  noticeable.  In  the 
third  test  there  is  a  much  large]  number  of  examples  attempted  in  each 
process.  In  this  test  the  score  conforms  rather  closely  to  the  eighth- 
grade  standard.  The  high  quality  of  accuracy  achieved  in  the  second 
test   has   suffered   somewhat. 

The  annual  growths  in  achievement  shown  by  this  card  are  very 
commendable.  In  the  seventh  grade  the  accuracy  is  improved.  There 
must  have  been  special  emphasis  placed  on  addition  and  division  to  im- 
prove those  scores  so  much.  In  the  eighth  grade  the  speed  is  improved. 
The  growth  attained  in  grade  eight  could  hardly  be  better,  for  in  this- 
one  year,  from  being  able  to  do  only  work  that  would  not  compare  fa- 
vorably with  the  work  of  fifth-grade  children,  this  girl  increases  her 
ability  so  that  it  almost  reaches  the  standard  set  for  eighth-grade 
children. 


Educational  Measurements 


77 


78 


Bulletin  of  the  Extension  Division 


INTERPRETATION  OF  THE  ABOVE  RECORD 

This    graph    shows   the   records   of   three    successive    tests   taken   by 

A O'N ,  the  first  and  second  in  October,  1914,  and  May, 

191"),  when  he  was  a  pupil  in  grade  seven,  and  the  third  in  May,  1916, 
when  he  was  a  pupil  in  grade  eight. 

In  the  first  test  this  boy  attempts  only  a  small  number  of  examples 
in  each  process.  Moreover,  his  work  in  addition  and  subtraction  is  de- 
cidedly inaccurate.  In  the  second  test  he  shows  improvement  in  both 
-peed  and  accuracy  in  every  process  except  division  where  he  attempts 
only  four  examples.  He  is  still  below  the  standard  in  speed  for  his  grade. 
In  the  third  test  he  has  made  such  progress  that  his  record  is  beyond 
hut  near  the  eighth-grade  standard. 

The  growth  from  year  to  year  in  this  child's  ability  is  good.  He 
entered  grade  seven  with  arithmetical  ability  lower  than  that  of  half  the 
children  in  grade  four.  When  he  left  grade  eight,  his  ability  had  been 
increased  so  that  it  was  higher  than  that  of  half  the  children  in  grade 
eight.     It   approximated  the  eighth-grade  standard. 


SYSTEMATIC  PRACTICE  MATERIAL  INTRODUCED 

Undoubtedly,  the  most  important  means  of  improving  arithmetic  in- 
struction in  the  four  fundamental  operations  has  been  the  introduction 
of  systematic  practice  material  or  exercises  which  take  the  place  of  the 
former  general  class  drill  in  which  every  child  practiced  on  the  same  ex- 
ercise. The  Courtis  tests  have  demonstrated  conclusively  that  the  mass 
or  class  drill  exercises  in  addition,  subtraction,  multiplication,  and  di- 
vision are  ineffective  and  uneconomical.  Mass  drills  provide  unneces- 
sary practice  for  the  more  capable  third  of  the  class  and  hence  are  un- 
economical; they  do  not  provide  enough  practice  for  the  less  capable 
third  of  the  class  and  hence  are  ineffective.  An  economic  distribution 
of  drill  requires  that  it  provide  a  means  of  reaching  the  individual  needs 
of  the  40  pupils  of  the  class. 

In  an  address  before  the  Masters'  Association  on  October  13,  1914, 
the  writer  defined  the  essential  characteristics  of  satisfactory  practice 
material  in  arithmetic. 

1.  It  must  be  easily  handled  by  pupils  and  teachers  within  a  limited 
time  allotment,  say  ten  minutes. 

2.  It  must  provide  for  the  correction  of  the  problems  by  the  pupils 
and  not  by  the  teacher. 

3.  It  must  provide  for  each  child  a  kind  of  work  suited  to  his  par- 
ticular needs. 

4.  It  should  provide  the  pupil  with  a  method  by  which  he  can  keep 
a  record  of  his  daily  progress. 

5.  It  should  be  progressively  graded  so  that  the  child  may  pass  from 
one  kind  of  exercise  to  another  as  his  ability  increases,  always  finding 
the  kind  of  exercise  in  which  he  needs  practice. 

6.  It  should  provide  a  means  of  keeping  a  record  of  the  progress 
of  the  class. 


Educational  Measurements 


79 


IMPROVEMENT  IN  SCHOOL  SYSTEM  AS  A  WHOLE 

A  splendid  opportunity  presented  itself  in  1915  to  measure  the  re- 
sults of  the  systematic  work  which  had  been  carried  on  in  educational 
measurement  in  Boston  because  the  testing-  work  had  by  that  time  be- 
come city- wide,  and  because  the  schools  could  be  easily  classified  as  to 
participation  in  the  measurement  work.  Schools  were  grouped  into 
Group  A,  Group  B,  and  Group  C. 

Group  A  schools  in  the  following  discussion  are  those  in  which  the 
tests  have  been  given  since  1912.  In  Group  A  there  ai*e  29  schools  and 
18,391  pupils  represented. 

Group  B  schools  are  those  which  were  added  during  the  second  year 
or  the  first  half  of  the  third  year  of  testing.  They  are  schools  which 
have  been  tested  from  one  to  two  years,  and  are  schools  in  which  the 
effects  of  the  testing  may  legitimately  vary.  In  Group  B  there  are  17 
schools  and  15,241  pupils  represented. 

Group  C  are  those  in  which  the  tests  were  given  for  the  first  time 
in  May,  1915.  They  are  schools  which  have  not  been  affected  by  the 
Courtis  testing  work  in  the  city,  except  in  so  far  as  general  discussion 
of  the  work  has  affected  individual  teachers  or  principals.  It  should  be 
said,  however,  that  principals  of  these  schools  have  had  opporunity  to 
introduce  the  practice  material,  and  several  of  them  did  so.  If  these 
schools  had  not  been  at  all  affected  by  the  system  of  eductional  measure- 
ment in  the  city,  the  superiority  of  the  schools  tested  would  have  been 
even  greater  than  it  is.  In  Group  C  there  are  17  districts  and  11,836 
pupils  represented. 

The  comparison  of  the  results  achieved  in  1915  by  these  groups  of 
schools  is  as  follows: 


Addition 

Comparison  of  Median  Scores  of  Croup  A.  Group   B,  and  Group  C  Schools, 

May.    1915 


Grade 

Group  A 

(  rROTJP   B 

Group  C 

Speed 

Accuracj 

Speed 

L3  1 

1 1  6 

10  4 

9  ii 

7  6 

Accuracy 

Speed 

Accuracy 

VIII. 

14  4 

12  9 

11  5 

9.3 

8.2 

79  07' , 
77  50' , 
7s    17', 
72  93<  | 

7ii  oil'  , 

77  S4'  ; 

71    15', 
72.72', 
71   f,|<, 
f>5.15', 

12  Ii 

11    ii 

11    1 

9   1 

7  7 

76   11', 

VII 

71  21', 

VI 

71    11', 

v    r. 

IV 

68.04', 
63  11', 

80 


IU'LLETIN    OF   THE    EXTENSION    DIVISION 


Subtraction 

Comparison  of  Median  Scores  of  Group  A.  Group  B,  and  Group  ('  Schools, 

May.    1915 


(  i  R  \[>K 

Group  A 

Group  B 

Group  ( ' 

Speed 

Accuracy 

Speed 

Accuracy 

Speed 

Accuracy 

VII] 

11  6 

12  9 

89  73< ! 

s7  57'  , 

L3   1 
11   7 

88  77' , 
s(i  26<  j 

L2    1 
11  5 

86  92'  j 

VII 

si  '.»_", 

VI 

11  5 

ss  III', 

10  7 

85.69* , 

11.0 

88  73% 

V 

9  3 

85  1  1' , 

9    1 

84    II', 

9  00 

80  64«  , 

[\ 

7  9 

83  63*  , 

1    " 

si  :,r,', 

7.1 

78  28«  J 

Multiplication 

Comparison  of  Median  Scores  of  Group  A.  Group   B.  and  Group  C  Schools. 

May,    1915 


( in  \iu: 

Group  A 

(  rROUP    B 

Group  ( ' 

Speed 

12  1 

10  8 

9  5 

7  6 

6  1 

Accuracy 

82  58'  , 
81.589? 

78   15', 
75  69« ; 
7i)  '.».".'  , 

Speed 

Accuracy 

SI     111', 

79  28'  i 
76  07', 

70  01', 
(17    15'  , 

Speed 

Accuracy 

VIII 

II    2 

10  3 

s  8 

5  '.( 

10  8 
II)  1 

9   1 
7  3 
5  s 

80.  I'-", 

VII 

80  us'  , 

VI 

7!)  93"  , 

V 

70  97'  , 

IV 

63  id', 

Division 

Comparison  of  Median  Scores  of  Group  A.  Group  B    and  Group  C  Schools, 

May.    1915 


( }r \de 

(  Iroup  A 

Group  B 

(  1  ROUP   C 

Speed 

Accuracy 

93   16'  , 

89  ::i\ 
88  1)9' , 
78  98'  , 
65  s.v  , 

Speed 

Accuracy 

Speed 

Accuracy 

VIII 

12  7 

10  9 

s  9 

6.2 

4  7 

12  0 
9  6 
7  9 
6  7 
4.6 

92  08' , 
87  63< , 
85  ()-", 
80  05< , 
59  11', 

11.0 
9  3 
8.6 

li  2 

4.8 

89  71"  ; 

VII 

si;  37' , 

VI 

86.72',; 

V 

73.  OS', 

1  V 

63  II', 

Educational  Measurements  81 

Undoubtedly,  the  objective  standards  of  achievement,  the  various 
reports  to  teachers,  to  principals,  and  to  the  superintendent,  the  printing 
of  bulletins  for  general  distribution,  and  the  introduction  of  scientific 
practice  material,  have  all  contributed  something  toward  the  improve- 
ment of  results  in  arithmetic.  Of  more  pronounced  effect  probably  than 
any  of  these  factors,  however,  has  been  the  stimulation  among  teachers 
of  an  inquiring  attitude  toward  the  whole  problem  of  arithmetic  instruc- 
tion. The  results  from  the  tests  have  shown  that  the  problem  of  arith- 
metic teaching  is  not  yet  solved ;  and  they  have  prompted  may  teachers 
to  study  their  own  work  as  the  first  step  toward  improving  methods  of 
instruction. 

These  results  seem  to  prove  two  important  propositions:  (1)  That 
the  scientific  measurement  of  educational  results  is  possible  and  prac- 
ticable; and  (2)  that  educational  measurement  may  be  a  means  of  im- 
proving those  educational  results. 


6—25670 


Some  Dangers  of  Scientific  Measurement  to  be 

Avoided 

(Abstract) 


Frank   W.  Ballot,  Superintendent  of  Schools,  Washington,  D.C. 


What  I  shall  have  to  say  on  the  dangers  of  scientific  measui-ement 
has  grown  out  of  my  experience  and  is  intimately  related  to  my  address 
of  Friday  afternoon  ["Improving  Instruction  Thru  Educational  Meas- 
urement"]. In  fact,  these  two  addresses  supplement  one  another.  The 
one  on  Friday  afternoon  showed  what  was  accomplished  over  a  period 
of  years,  thru  educational  measurement;  this  talk  will  concern  itself 
with  some  of  the  dangers  of  which  all  school  people  must  take  cognizance 
if  these  dangers  are  to  be  avoided. 

At  the  outset  I  want  to  apologize  for  the  dogmatic  method  which  I 
shall  follow  in  this  talk.  I  follow  that  method  because  it  makes  for 
directness,  because  1  can  thereby  cover  more  ground,  and  also  because 
I  want  to  close  on  time,  so  that  time  shall  be  left  for  discussion  fol- 
lowing  my   talk.      I    shall    discuss   four   dangers,   the   first   of   which    is: 

1.     DANGER    THAT    RESULTS    OF    MEASUREMENT     WILL 
NOT  BE  UNDERSTOOD  BY  THE  TEACHERS 

When  I  began  work  in  Boston,  the  Courtis  Standard  Tests  had  been 
given  several  times.  A  very  intelligent  teacher  stated  that  her  pupils 
had  taken  the  test  three  times,  but  she  could  see  no  change  in  them. 
This  teacher  seemed  to  feel  that  the  tests  were  like  medicine,  and  that 
the  children  should  improve  in  their  educational  work  after  each  dose. 
The  teacher  must  be  taught  the  purpose  and  use  of  the  results  of  educa- 
tional tests  if  they  are  to  make  appropriate  use  of  them.  I  need  not 
say  to  this  group  of  people  that  there  is  nothing  magical  about  the  tests. 
The  value  of  the  tests  depends  upon  the  intelligent  use  by  teachers  and 
school  officers  of  the  information  which  the  tests  furnish  them. 

Educational  measurement  has  necessitated  our  familiarizing  our- 
selves with  new  terms  and  indeed  a  new  language.  The  terms  "median", 
"mode",  "intelligence  quotient"  (i.  q.)  and  many  others  must  now  be- 
come a  part  of  our  professional  language.  Teachers  must  be  familiar 
with  these  terms  if  they  are  to  read  intelligently  professional  journals. 

Not  only  must  teachers  be  familiar  with  the  new  language  of  edu- 
cational measurement  but  they  must  also  be  able  to  interpret  correctly 
charts,  graphs,  and  other  means  used  to  represent  the  results  secured 
thru  educational  measurement.  I  present  herewith  some  charts  about 
which  I  shall  ask  some  questions  and  make  some  comments  to  illustrate 
the  necessity  of  teachers  being  familiar  with  such  charts. 

(82) 


Educational  Measurements 


83 


8 


■1 

> 

3 

0 

o 

w 

o 

^ 

f- 

m 

a. 

E 

< 

^s 

M 

^ 

vS 

vr 

t-' 

W 
£ 

^ 

^ 

vD 

^ 

cv 

to 

H 

h 

H 

E 

w 

■5 

*b 

do 

^ 

> 

</> 

*- 

OS 



u 

!/] 

ffi 

D 

£ 

— 

^ 

cv 

W 

i 

(- 

f) 

U 

a 
E 

< 

Vi 

i 

^**" 

ri 

Q 

C 
< 

n 

X 

£ 

r^ 

v*> 

t 

cv 

r^ 

n 

1- 

< 

c- 

Oo 

<3- 

r< 

^ 

— 

L    , 

«S 

(« 

m 

m 

"5 

»d 

8 

& 

C 

ht 

C 

c 

C 

h 

h 

H 

h 

H 

h 

h 

— 

(A 

*o 

•o 

.a 

.a 

J3 

J3 

*■* 

(N 

CO 

■»r 

m 

vO 

tN 

H 
Q 

-,_ 

< 

^ 

^ 

r 

2: 

£: 

J- 

^ 

'*n 

^ 

J" 

^mm 

"■— 

^■■i 

ATE 

\J~ 

<J~ 

o- 

vr- 

^? 

a 

""■■" 

( 

1 

« 

i 

1 

r^ 

i 

<> 

l 

» 

i 

/ 

O 

-4^ 

-^ 

> 

cSl 


Bulletin  of  the  Extension   Division 


This  record  shows  that  this  girl  appears  to  have  made  unusual 
progress  in  drill  work  during  the  eighth  year.  The  question  is 
whether  the  child  was  not  overd rilled.  Only  the  teacher  who  taught 
the  child  can  answer  this  question.  For  an  eighth-grade  pupil  to  im- 
prove in  drill  work  from  the  ability  represented  by  (5  examples  attempted 
and  none  of  them  right  to  1!)  attempted  and  19  right,  represents  great 
improvement.  Was  this  improvement  seemed  at  the  sacrifice  of  other 
studies? 


KlHCATInNAI.    MKAS1  RKMENTS 


85 


to 

I- 

111 


i- 

Ui 

S 

z 

E 

< 

u. 
O 

Q 
(C 

o 
o 


rQp 


^ 


^ 


o 
o 

x: 
o 
CO 


V 

B 

Z 


o 

"C 

*-> 
w 

s 

"o 
o 

.c 
cj 

CO 


1 

■ 

>  £ 

CO 

x£ 

^ 

0 

*-< 
< 

i>- 

•   ^ 

O" 

LT. 

ights 

4 

^C 

bQ 

D    K 

2 

■ 

cn 

f-    2          1 

w     o. 

OQ            S 

^ 

^ 

"S 

< 

CO 

SERII 
hts 

CO 

w 

CO     ~                   J 

^v 

'""* 

D 

in 

CJ 

m     a.          p»— 
Id     S           VJ 

'  ^ 

cd 

Co 

< 

BB 

i 

g!    '-o 

^ 

vS 

t- 

< 

P4 

TEST 
Attempts 

1  2 

i 

i 

"3 

.2 

01 

S3 

0} 

et 

CJ 

h 

t-i 

*-» 

T3 

•o 

.c 

x 

J3 

J3 

»-• 

CI 

ol 

f 

m 

IO 

r~ 

S              IT 

o 

-   r- 

■     r- 

V 

:> 

DC 

i 

•    5 

1 

1    ^ 

<J 

■  ■ 

1 

Id                   — 

d 

i 

.  '« 

(- 

-< 

: 

<                      J 

Q                   ' 

c 

s 

h 

> 

fr- 

Bulletin of  the  Extension  Division 

This  boy,  thirteen  years  old,  who  graduated  from  the  eighth  grade 
was  deficient  in  addition  and  division  particularly.  The  teachers  must 
be  trained  to  use  this  information  and  should  be  able  to  explain  why  the 
pupil  can  be  trained  to  do  satisfactory  work  in  subtraction,  somewhat 
less  satisfactory  work  in  multiplication,  but  be  unable  to  do  division. 


Educational  Measurements 


87 


88 


Bulletin  of  the  Extension  Division 


The  record  of  A.  W.  is  stated  in  graphic  form.  Teachers  must  be 
trained  to  interpret  such  graphs,  since  they  are  becoming  increasingly 
important  in  our  educational  literature. 


DISTRIBUTION   OF   JANUARY   AND    APRIL    SCORES   COLIPARED 


SUBTRACTION  RIGHTS     GRADE  V. 


5  i   6.6 


JANUARY 
MEDIAN 


APRIL 

MEDIAN 


8 


LO 


12 


14 


ie 


L8 


uu 


The  above  graph  shows  that  the  children  of  the  fifth  grade  advanced 
from  a  median  of  4.5  to  a  median  of  6.6,  from  January  until  April. 
The  teachers  must  understand  the  significance  of  this.  Not  every  child 
made  progress.  Some  children  undoubtedly  did  less  well  in  April  than 
in  January.  The  median  represents  the  combined  progress  of  all  in- 
dividuals. 


Educational  Measurements 


89 


ivxox 

01 

CN 

Ol 

|- 

= 

" 

-*" 

^i 

c-i 

e^ 

o 

© 

X 

>T            LO 

—            ^« 

S 

U      ** 

<;     "~ 

ti      o 

£>        OS 
X 

M 

O 

a 

K 

C 

X 

W 

c 

A) 
U 

e 

H 

•a 
c 

>» 
W 

59 

L. 

3 
U 
U 

c    c 

- 



< 

©    t- 

d    si 

"tf     CO 

o     o 

C      C 

■-      ~ 

9) 
V 

a. 

01 

- 

co 

c 

CO 

CN 

i-H 

CO 

"O 
u 
S3  .-• 

1— < 

- 

C    35 
sS    — 

TJ     03 
X 

i-H 

rH 

<N 

>o 

Ol 

^H 

00 

~ 

CO 

d    o 
©    © 

a  > 

CO 

rH 

rH 

i-H 

rH 

t^ 

>1     >i 

—      q3 

DO    — 

O    — 

— 

CN 

i 1 

I~ 

E    » 

— 

CO 

l 1 

10 

O     ; 

i — i 

■-1 

s  s 

ri 

-3     "3 

«     o 

0 

T-H 

CO 

X    '/. 

X 

CO 

e4 

^H 

.— ' 

01 

o  S 

-   —     X 

35 

<— 

.;:  —  ~ 

O 

o 

4) 

—  "^  ~ 
S      x 

43 
XI 

c 

5 

3 
Xi 

H 

© 

1 

o 

© 

O 

00 
A 

© 

i\  !l    li 

© 

© 
10 

<M 


Bulletin  of  the  Extension  Division 


This  is  the  record  of  the  class  which  reaches  approximately  the 
standards  established  for  the  school  system.  It  is  to  be  noted  that  even 
tho  this  class  approximates  the  standard,  nevertheless  there  are  many 
children  who  need  special  attention.  The  median  is  a  class  measure. 
Individuals  in  that  class  must  be  considered.  For  example,  one  pupil 
attempts  only  5  problems  and  ^ets  less  than  half  of  them  correct.  Three 
pupils  attempted  as  few  as  6  examples,  with  an  accuracy  of  from  60 
to  80  per  cent.  Such  cases  must  receive  consideration,  even  tho  the 
class  record  is  satisfactory. 

2.  DANGER  THAT  TESTS  WILL  BE  SO  ORGANIZED  THAT 
THE  ABILITIES  OF  PUPILS  WILL  NOT  BE  ACCU- 
RATELY REPRESENTED  BY  THE  STATEMENT  OF 
RESULTS 

Obviously  this  is  a  danger  which  must  be  of  chief  concern  to  those 
who  compare  achievement  tests  or  intelligence  tests.  Nevertheless  it  is 
of  such  importance  that  it  is  worth  while  to  give  some  attention  to  it  on 
this  occasion. 

To  illustrate,  I  select  the  Courtis  Standard  Tests  in  arithmetic. 
You  will  recall  that  each  problem  in  the  Courtis  Standard  Test  in  addi- 
tion consists  of  3  columns  of  figures,  with  9  figures  in  a  column. 
Let  me  recall  for  you  also  that  the  standard  of  achievement  based  upon 
these  tests  for  the  city  of  Boston  was  12  problems  attempted  with  10 
problems  correctly  solved,  or  an  accuracy  of  83. 3  per  cent. 

The  various  possibilities  with  respect  to  accuracy  are  shown  in  the 
following  tabulation: 


Table  of  Illustrations 


Attempts 

Rights 

Per  Cent .  of 
Accuracy 

Stand \rd 

12 

10 

83.3% 

10 

8 

80.0% 

11 

9 

81.8% 

12 

10 

83.3% 

13 

11 

84.6% 

14 

12 

81.4% 

15 

13 

sc,  <;< ; 

Let  us  examine  the  actual  facts  with  respect  to  the  accuracy  with 
which  pupils  do  their  work  in  reaching  the  standard  established.  To 
solve  each  one  of  the  problems  in  addition  a  pupil  must  make  26  compu- 
tations; 8  computations  in  the  first  column  and  9  computations  in  each 
of  the  two  following  columns,  owing  to  the  number  carried.  If  he 
solves  the  12  represented  by  the  standard,  he  must  therefore  make  312 
computations    (12x26).     If  the  candidate  made  one  error  in  each   of  2 


Educational  Measurements  91 

problems  his  accuracy  is  actually  310  computations  out  of  312,  or  a  very 
high  percentage  of  accuracy.  The  computed  percentage  of  accuracy  of 
83.3  per  cent,  based  on  12  problems  attempted  and  10  right,  represents 
the  pupil's  accuracy  only  in  case  he  failed  in  every  one  of  the  52  com- 
putations in  the  2  examples  which  were  marked  incorrect. 

3.     DANGER  THAT  TESTING  WILL  MISDIRECT  TEACHING 

If  testing  is  for  the  purpose  of  revealing  facts  to  the  teachers  to  the 
end  that  the  teachers'  instruction  may  be  improved,  then  educational 
measurement  is  of  primary  concern  to  the  teacher.  The  teacher  must 
be  expected  to  be  influenced  by  the  results  of  the  testing.  Teachers,  as  a 
rule,  undertake  to  meet  whatever  standards  are  established  for  their 
respective  classes.  Herein  lies  the  great  advantage  as  well  as  the  dis- 
advantage of  testing.  It  is  an  advantage  in  case  the  standards  are 
correct.  It  may  become  a  very  great  disadvantage  if  the  results  of  the 
testing  or  the  standards  misdirect  the  teachers.  A  few  observations 
will  make  this  point  clear. 

For  the  most  part,  educational  measurement  thus  far  has  had  to  do 
with  the  formal  aspects  of  subjects  tested.  Various  phases  of  arith- 
metic have  been  accurately  measured.  Standard  tests  in  geography  have 
had  to  do  largely  with  place  geography.  The  formal  side  of  English 
composition  has  been  measured,  but  less  success  has  been  obtained  in  the 
content  side  of  these  and  other  subjects. 

When  the  Department  of  Educational  Measurement  was  organized 
in  Boston  it  attempted  to  establish  lists  of  words  for  spelling  which 
were  most  difficult  and  which  every  pupil  should  know  how  to  spell.  At 
the  beginning  the  number  of  words  assigned  for  each  grade  was  com- 
paratively small.  At  the  end  of  each  year  pupils  were  tested  with  these 
words.  In  their  zeal  the  teachers  had  taught  these  words  so  thoroly 
that  pupils  spelled  them  with  a  high  degree  of  accuracy.  As  the  number 
of  words  for  each  grade  was  increased  teachers  were  unable  to  give 
such  intensive  training  in  each  word.  The  result  is  that  the  standard  of 
accuracy  established  for  the  words  originally  carried  in  the  lists  for  the 
various  grades  has  not  been  reached  in  later  years.  Undoubtedly  too 
much   time   and  attention  was  devoted   to   spelling   in   those  early  days. 

When  the  Harvard-Newton  scales  for  the  measurement  of  English 
composition  were  being  prepared  it  was  the  intention  to  provide  a  scale 
for  each  of  the  five  forms  of  English  composition,  namely:  description, 
narration,  exposition,  argumentation,  and  reproduction.  After  careful 
consideration  those  preparing  the  composition  scales  concluded  to  omit 
reproduction  since  it  is  a  form  of  composition  that  should  probably  not 
be  further  emphasized  in  our  schools.  It  was  felt  that  reproduction 
is  too  frequently  resorted  to  by  the  teachers,  that  it  is  not  the  best  form 
of  training.  Inevitably  teachers  would  feel  that  it  was  equally  impor- 
tant with  other  forms  of  composition  should  a  measuring  scale  be  pro- 
vided for  it,  hence  this  composition  scale  was  not  printed  for  distribu- 
tion. 

There  is  another  aspect  of  the  measuring  of  English  composition 
which  I  desire  to  allude  to.     The  most  usual  form  of  composition  scale 


92 


Bulletin  of  tiil  Extension  Division 


is  one  which  contains  a  series  of  compositions  by  which  the  compositions 
of  children  of  elementary  school  and  high  school  shall  be  measured.  1 
desire  to  record  my  opinion  that  poor  compositions  in  grade  eight,  that 
for  example  may  be  rated  <>()  per  cent,  are  not  fair  measures  of  the  work 
done  in  the  grades  preceding'  grade  eight.  The  composition  set  up  as 
the  best  composition  in  a  scale  should  be  an  ideal  towards  which  the 
pupil  may  work  as  well  as  the  measure  of  the  quality  of  their  efforts. 
A  (>()  per  cent  composition  written  by  an  eighth  grade  pupil  cannot 
possibly  be  held  up  as  an  ideal  toward  which  children  in  the  lower  grades 
may  work. 


JUNE 


JULY 


15         18        21         23        27         30 


24        26        29  1  3  6        13 

23      25        27        30  2  6  9  14      16  20        22        25         dU 


Educational  Measurements  93 

4.    danger  that  the  results  of  testing  will  be 
given  too  great  weight 

In  view  of  my  address  on  Friday  afternoon,  I  feel  that  there  is  no 
danger  of  my  being  misunderstood  when  I  say  that  the  results  of  testing 
work  may  be  given  too  great  weight.  I  consider  those  results  invaluable; 
however,  while  the  results  of  testing  constitute  important  data  they  are 
not  exclusive  data.  Scholastic  marks  are  still  of  importance.  Pupils' 
interests  and  attitudes  must  likewise  be  taken  into  consideration  along 
with  the  results  of  testing,  if  school  officials  and  teachers  are  to  exercise 
wise  direction  of  a  pupil's  education. 

Intelligence  tests  and  accomplishment  tests  must  supplement  one 
another.  The  results  of  these  two  tests,  together  with  the  knowledge 
of  the  pupil  and  his  ability  which  teachers  and  officers  have  heretofore 
possessed,  will  form  a  comprehensive  basis  on  which  to  plan  a  pupil's 
education. 

There  is  danger  that  the  results  of  the  testing  may  be  given  too 
great  weight  if  only  one  test  of  a  pupil  or  a  class  is  used  as  the  basis 
of  judgment.  As  an  illustration  I  insert  here  a  graph  which  shows  the 
progress  of  a  class  exercised  with  drills  and  tested  daily  over  a  period 
of  thirty  school  days. 

The  graph  indicates  that  the  progress  of  the  class  is  not  uniformly 
upward,  but  that,  on  the  contrary,  on  some  days  the  class  as  a  whole 
does  better  than  it  does  on  other  days.  Its  progress  in  general  is 
upward.  Care  must  be  taken  in  the  use  of  such  results,  because  when 
the  class  is  tested  it  may  be  that  the  class  is  (a)  on  the  mountain  peak, 
(b)  in  the  valley,  or  (c)  on  a  plateau.  The  results  ought  to  be  consid- 
ered accordingly. 

CONCLUSIONS 

I  have  indicated  four  possible  dangers  in  educational  measurement. 
In  conclusion  I  desire  to  indicate  how  those  dangers  can  be  avoided. 

1.  The  danger  of  a  misunderstanding  of  testing  on  the  part  of  the 
teachers  can  be  avoided  by  the  instruction  and  training  which  is  now  a 
part  of  the  preparation  of  future  teachers.  Teachers  now  in  the  service 
can  be  provided  instruction  by  their  officers  and  can  receive  invaluable 
instruction  thru  conferences  like  this. 

2.  The  danger  that  the  results  from  a  test  will  not  represent  the 
abilities  of  the  pupils  can  be  avoided  by  the  exercising*  of  more  care  in 
the  preparation  of  tests.  This  is  a  danger  which  the  comparatively  few 
people  preparing  tests  must  undertake  to  avoid. 

3.  The  danger  that  the  tests  will  misdirect  teaching  can  be  avoided 
thru  the  extension  of  testing  work  to  cover  the  whole  field  of  instruc- 
tion; by  the  exercising  of  care  in  organizing  and  carrying  on  testing- 
work  so  that  the  attention  of  teachers  will  be  spread  over  a  large  field 
of  instruction;  and  finally  thru  teachers  trained  in  the  value  and  sig- 
nificance of  educational  measurement. 

4.  The  danger  that  the  tests  will  be  given  too  great  weight  will  be 
avoided  by  a  better  understanding  on  the  part  of  teachers  and  officers 
of  the  place  of  such  testing  in  a  general  program  of  educational  better- 
ment. 


Education  and  Our  Responsibility  for  Its 

Improvement 


Frank   W.   Ballou,  Superintendent    of  Schools-,   Washington,  D.C. 


Education  in  the  United  States  today  faces  one  of  the  most  serious 
crises  in  its  history.  The  war  revealed  the  inadequacy  of  some  parts  of 
our  educational  program.  The  war  likewise  created  conditions  in  our 
school  system  which  it  will  take  at  least  one  generation,  and  perhaps 
many  generations,  to  eradicate.  An  enlarged  and  a  more  intensive 
educational  program  is  a  national  necessity.  And  yet  the  financial  re- 
sources provided  for  public  education  are  not  sufficient  to  do  more  than 
to  maintain  the  educational  standards  existing  before  the  war.  The  pol- 
icy of  financial  retrenchment  in  state  and  nation  threatens  to  cheapen 
the  education  of  boys  and  girls  now  in  school. 

REVELATIONS  AND   RESULTS   OF   THE  WAR 

The  financial  burdens  resulting  from  the  war  are  large.  However 
great  our  financial  burdens  may  now  be,  they  should  not  be  allowed  to 
blind  us  to  the  necessity  of  placing  adequate  financial  resources  at  the 
disposal  of  the  state  for  the  appropriate  education  of  the  youth  of 
America  now  found  in  our  elementary  schools,  secondary  schools,  colleges, 
and  universities.  As  far  as  possible,  the  burden  of  war  debts  should  be 
borne  by  the  present  generation.  Financial  obligations  incurred  by  the 
present  generation  should  not  be  passed  on  to  future  generations.  There 
is  a  grave  danger  that  the  financial  burdens  of  the  war  are  likely  to  be 
visited  on  the  pupils  now  in  our  public  schools  thru  reduced  educational 
opportunities  provided  for  them. 

For  the  most  part,  the  young  men  inducted  into  the  American  Ex- 
peditionary Forces  were  the  product  of  the  public  school  system.  No 
one  can  successfully  assail  their  courage,  pluck,  alertness  of  mind,  and 
other  personal  qualities  which  go  to  make  efficient  citizens  in  times  of 
peace  or  of  war.  They  met  every  test  of  good  soldiers,  as  they  had 
previously  given  promise  of  meeting  every  requirement  of  good  citizens. 
The  nation  owes  a  debt  of  profound  gratitude  which  is  bound  to  increase 
as  the  years  go  by,  not  only  to  those  who  sleep  in  the  poppy  fields  of 
France,  but  also  to  those  who  have  returned  home  to  assume  the  re- 
sponsibilities of  citizenship  in  popular  government. 

The  less  favorable  conditions  among  our  youthful  citizens  revealed 
by  the  war  must  likewise  be  faced  and  cannot  be  lightly  passed  by. 
While  we  extol  the  qualities  of  those  who  served  the  nation,  we  must 
also  give  attention  to  those  young  men  of  draft  age  who  were  rejected 
or  put  into  limited  service.  The  army  educational  tests  showed  that 
29.4  per  cent  of  the  men  included  in  the  first  draft  were  unable  to  pass 
successfully    a    simple    test    of    intelligent    reading   and    legible    writing. 

(94) 


Educational  Measurements  95 

The  draft  of  June,  1917,  showed  nearly  a  hundred  thousand  illiterates 
or  those  who  were  approximately  illiterates.  The  majority  of  these 
illiterates  were  native-born.  They  were  unable  to  serve  their  country 
in  time  of  war  because  of  their  illiteracy. 

Moreover,  the  records  show  that  of  the  3,208,000  men  included  in 
the  first  draft  and  physically  examined,  949,000  or  29.6  per  cent  were 
totally  or  partially  disqualified  for  military  duty  for  physical  reasons. 
In  addition,  the  last  Federal  census  shows  that  there  were  4,931,905 
illiterates  in  the  United   States.     Of  these,  3,084,748  were  native-born. 

inadequacy  not  incompetency  of  education 

These  figures  on  illiteracy  among-  our  native-born  young  men  are 
not  so  much  evidence  of  failure  to  do  well  what  the  school  system  under- 
took to  do  but  rather  failure  to  make  the  educational  system  as  com- 
prehensive as  it  ought  to  be.  These  facts  reveal  the  incompleteness  of 
public  education  rather  than  its  incompetency. 

When  one  reviews  the  conditions  in  the  educational  system  of 
America,  the  revelations  of  the  war  do  not  appear  startling.  On  the 
contrary,  they  are  the  logical  results  of  conditions  existing  in  our  sys- 
tem of  education.  The  following  figures  represent  conditions  which 
contribute  to  the  ineffectiveness  of  American  education: 

Of  the  600,000  teachers  in  American  elementary  and  secondary 
schools,  30,000  have  only  an  eighth-grade  education  or  less.  One-fourth 
of  the  600,000  teachers  have  had  only  two  years  of  high  school  work  or 
less.  Only  one-fifth  of  the  600,000  teachers  possess  the  standard  of 
professional  preparation  recognized  as  a  minimum  by  nations  generally. 
One-fourth  of  these  600,000  teachers  have  had  less  than  two  years  of 
experience  and  one-half  of  them  have  had  less  than  four  years  of  ex- 
perience. 

The  typical  rural  school  teacher  in  America  has  had  less  than  two 
years  of  high  school  work,  is  under  twenty-one  years  of  age,  has  had 
less  than  two  years  of  experience  in  teaching  and  is  at  least  twenty 
miles  from  a  superintendent  or  supervisor  who  visits  her  only  once 
a  year. 

The  inadequacy  of  American  education  is  to  be  accounted  for,  in  a 
large  measure,  by  the  conditions  indicated.  These  conditions  contributed 
to  or  permitted  the  educational  defects  among  the  men  drafted  for  the 
Great  War.  These  conditions  and  others  similar  to  them  made  possible 
the  near  illiteracy  and  physical  defects  revealed  in  the  drafts.  No  im- 
provement can  be  made  in  overcoming  illiteracy  or  in  improving  the 
physical  condition  of  our  boys  and  girls  until  the  conditions  which  made 
possible  the  revelations  of  the  draft  shall  have  been  improved. 

EFFECTS  OF  THE  WAR  ON  SCH03L  SYSTEM 

The  school  as  a  social  institution  would  natr  "ally  not  escape  from 
participation  in  the  war  activities  or  certain  effee'.;  brought  about  as  a 
result  of  the  great  conflict.  Many  teachers  and  school  officers  were 
called  into  service  directly  connected  with  the  war.  Teachers  and  offi- 
cers who  remained  at  their  school  posts  devoted  themselves  not  only  to 


96  Bulletin  of  the  Extension  Division 

their  .school  work  but  also  to  many  and  varied  war  activities  outside  of 
the  classroom.  Moreover,  pupils  were  called  upon  to  participate  in  the 
great  variety  of  war  activities,  during  the  time  ordinarily  devoted 
directly  to  more  direct  educational  work.  The  school  system  as  a  whole 
contributed  in  many  ways  directly  and  indirectly  to  the  prosecution  of 
the  war. 

While  the  school  system  is  apparently  gradually  returning  to  normal 
conditions  and  is  devoting  itself  to  its  pre-war  educational  progress, 
there  is  one  outstanding  result  of  the  war  which  is  likely  to  be  over- 
looked. Most  school  systems  replaced  the  teachers  and  officers  entering 
the  war  service  with  persons  far  below  the  professional  and  educational 
qualifications  of  those  who  left.  Thus  far  the  supply  of  competently 
trained  teachers  has  not  been  sufficiently  large  to  replace  those  persons 
in  the  school  system  whose  professional  qualifications  and  teaching  skill 
are  not  up  to  to  the  American  standard.  Even  if  salary  conditions  which 
prevailed  preceding  the  war  are  re-established  it  will  take  at  least  a 
generation  to  re-establish  the  professional  standing  of  the  teaching  pro- 
fession in  America.  If  the  salaries  are  not  raised  to  a  level  which  will 
represent  a  purchasing  power  equal  to  the  purchasing  power  of  teachers' 
salaries  before  the  war,  the  professional  standards  of  the  teaching  pro- 
fession must  continue  to  be  lower  for  more  than  a  generation. 

The  war  brought  all  schoolhouse  construction  to  a  standstill.  Neither 
labor  or  building  material  was  available  for-  the  carrying  on  of  the 
usual  amount  of  building  to  meet  the  needs  of  our  growing  systems  of 
public  education.  As  a  result,  the  school  buildings  of  the  nation  are 
wholly  inadequate  to  house  satisfactorily  the  pupils  who  are  annually 
seeking  education  in  elementary  schools,  high  schools,  colleges,  and  uni- 
versities. 

THE  PRESENT  SITUATION 

During  the  war  it  took  the  undivided  attention  of  educators  gen- 
erally to  keep  the  schools  running  at  all.  The  shortage  of  teachers  was 
universal.  The  lack  of  school  buildings  in  which  to  house  the  pupils 
gradually  became  acute.  Other  administrative  difficulties  have  so  com- 
pletely occupied  the  time  of  the  school  men  and  women  that  little  or  no 
thought  has  been  devoted  to  educational  aims,  methods  of  teaching,  and 
educational  results.  The  discussion  of  educational  principles  is  almost 
completely  left  out  of  many  of  our  educational  meetings,  and  discussion 
of  the  schoolhouses,  the  school  budget,  and  the  shortage  of  teachers 
has  taken  its  place. 

While  better  buildings  make  better  education  a  possibility,  it  takes 
more  than  a  good  school  building  to  make  an  educational  institution. 
The  longer  adequate  appropriations  for  buildings  are  delayed,  the  longer 
will  the  discussion  of  educational  principles  be  postponed,  because  the 
latter  must  await  the  former.  And  better  educational  results  must 
likewise  await  both. 

Our  elementary  schools  thruout  the  cities  of  the  country  are  greatly 
overcrowded.  Part-time  prevails  in  astonishing  proportions.  One-room 
portable    schools    are    being    used    instead    of    well-equipped    permanent 


Educational  Measurements  97 

buildings.  Classes  have  been  increased  in  size  beyond  reason.  The 
educational  program  has  been  curtailed  in  many  ways.  The  so-called 
Gary  plan  has  been  tried  as  a  means  of  accommodating  a  larger  number 
of  pupils  than  the  buildings  were  constructed  to  accommodate.  Other 
plans  of  various  kinds  which  make  our  schools  into  factories  or  mills 
instead  of  efficient  educational  institutions  have  been  put  into  operation 
in  an  attempt  to  educate  the  masses  of  youth  of  the  present  generation. 

Larger  and  larger  numbers  of  our  boys  and  girls  are  going  to  high 
school.  This  should  be  our  glory,  but  it  is  in  reality  an  embarrassment. 
High  schools  have  been  forced  to  accommodate  pupils  beyond  their  capaci- 
ties. Various  kinds  of  double-shift  programs  have  been  devised  to  make  at 
least  the  attendance  of  pupils  a  possibility.  Some  day  when  these  same 
boys  and  girls  shall  have  become  citizens  I  fear  they  will  be  discussing 
and  condemning  the  conditions  which  we  permitted  to  stand  between 
them  and  an  adequate  education. 

Colleges  and  universities  are  likewise  crowded.  There  is  a  marked 
difference  between  the  policy  followed  in  the  public  schools  and  the 
policy  followed  in  the  colleges  and  universities.  The  public  school  offi- 
cials must  make  room  for  all  pupils  who  apply.  On  the  contrary,  when 
the  enrollment  of  a  college  or  university  reaches  the  capacity  of  the  in- 
stitution, frequently  the  doors  are  closed  and  admission  denied. 

When  I  was  associated  with  the  University  of  Cincinnati  more  than 
ten  years  ago,  most  colleges  and  universities  were  seeking  students. 
They  were  competing  with  one  another  to  increase  their  enrollments. 
Today  the  situation  is  the  reverse.  For  some  time,  many  colleges  for 
women  have  limited  their  enrollment. 

Recently  there  has  developed  much  public  discussion  which  is  the 
direct  outgrowth  of  the  large  enrollment  in  our  colleges  and  universi- 
ties. Instead  of  undertaking  to  provide  suitable  collegiate  and  univer- 
sity training  for  the  men  and  women  who  are  seeking  it,  the  present 
agitation  has  to  do  with  reasons  why  men  and  women  should  not  go  to 
college. 

The  present  discussion  centers  around  the  statement  of  President 
Ernest  M.  Hopkins  of  Dartmouth,  on  September  21,  when  he  made  the 
general  assertion  that  "Too  many  men  are  going  to  college."  On  that 
occasion  President  Hopkins  is  reported  to  have  said: 

"The  opportunities  for  securing  an  education  by  way  of  the  college 
course  are  definitely  a  privilege  and  not  at  all  a  universal  right.  The 
funds  available  for  appropriation  to  the  uses  of  institutions  of  higher 
learning  are  not  limitless  and  cannot  be  made  so  whether  their  origin 
be  sought  in  the  resources  of  public  taxation  or  in  the  securable  bene- 
factions for  the  enhancing  of  private  endowments.  It  consequently  be- 
comes essential  that  a  working  theory  be  sought  that  will  operate  with 
some  degree  of  accuracy  to  define  the  individuals  who  shall  make  up  the 
group  to  whom,  in  justice  to  the  public  good,  the  privilege  shall  be  ex- 
tended, and  to  specify  those  from  whom  the  privilege  should  be  with- 
held." 

Following  the  publication  of  those  views  of  President  Hopkins,  Chan- 
cellor E.  E.  Brown  of  New  York  University  has  said: 

7—25670 


08  Bulletin  of  the  Extension  Division 

"There  are  too  many  college  students  only  if  their  mere  numbers  pre- 
vent us  from  giving  a  fit  training-  to  the  destined  leaders  of  men  within 
the  next  generation,  but  we  cannot  admit  this  without  confessing  a  certain 
bankruptcy  of  inventive  resourcefulness,  a  limitation  which,  as  thinking 
men,  we  cannot  accept." 

President  Sidney  Mezes  of  the  College  of  the  City  of  New  York 
says : 

"If  President  Hopkins  means  to  say  that  the  country  is  getting  too 
many  college-trained  men,  I  must  disagree  with  him.  The  country  can- 
not have  too  many  men  who  are  well-trained.  But  I  think  he  was  talk- 
ing of  those  persons  who  are  actually  in  college  who  ought  not  to  be 
there." 

I  do  not  desire  to  enter  into  a  discussion  of  the  merits  of  the  prop- 
osition advanced  by  President  Hopkins.  I  cite  it  only  as  an  illustration 
of  what  I  believe  to  be  an  unworthy  attempt  to  justify  limiting  the  num- 
ber of  young  men  and  women  going  to  college.  I  believe  that  the  time 
will  come  when  every  college  and  university  will  feel  as  complete  a  re- 
sponsibility for  enrolling  all  young  men  and  women  who  desire  to  pursue 
collegiate  education  as  does  the  public  school  system  at  the  present  time. 
Collegiate  education  should  be  as  universally  open  to  young  men  and 
women  as  is  high  school  education.  Moreover,  I  believe  that  the  time  will 
come  when  colleges  and  universities  will  conduct  a  careful  self-examina- 
tion to  ascertain  whether  the  program  of  education  provided  in  our  col- 
leges is  as  diversified  as  are  the  social  needs  for  which  colleges  should 
train  men  and  women,  and  as  varied  as  are  the  capacities,  interests,  and 
future  careers  of  the  young  men  and  women  who  enter  those  institu- 
tions. I  believe  that  such  a  self-examination  will  reveal  that  many 
young  men  and  women  are  considered  unsuccessful  in  their  college  work 
only  because  they  do  not  fit  into  a  narrowly  conceived  collegiate  pro- 
gram, rather  than  because  they  do  not  possess  the  ability  to  pursue 
proper  collegiate  training  which  will  fit  them  for  their  future  careers 
as  useful  citizens. 

ENLARGED  EDUCATIONAL  PROGRAM  A  NECESSITY 

The  revelations  of  the  war  and  the  effects  of  the  war  have  made 
increased  demands  on  the  schools  for  education  of  a  better  quality  and 
for  an  educational  program  broader  in  scope.  Increased  attention  has 
been  given  to  the  problem  of  Americanization  of  foreigners.  Renewed 
interest  has  been  exhibited  in  the  passage  and  enforcement  of  compul- 
sory education  laws  to  the  end  that  native-born  boys  and  girls  shall  not 
grow  up  illiterate.  There  is  a  tendency  toward  increasing  the  length 
of  the  school  year.  Summer  schools  and  evening  schools  are  being  ex- 
tended to  provide  additional  educational  facilities. 

Increased  attention  is  likewise  being  given  to  physical  education. 
Medical  examination  of  pupils  is  a  part  of  the  educational  program  to 
the  end  that  the  physical  defects  of  the  boys  and  girls  may 
be  revealed  and  corrected  during  the  progress  of  pupils  thru  the  schools. 
Legislation  has  been  passed  in  several  states  prescribing  physical  edu- 
cation as  a  part  of  the  school  program. 


Educational  Measurements  99 

expenditures  vs.  policy  of  economy 

These  enlarged  educational  opportunities  and  these  extensions  of  the 
educational  program  involve  larger  expenditures  of  money.  This  de- 
mand for  increased  expenditures  for  public  education  appears  to  fly  in 
the  face  of  a  public  demand  for  lowering  the  cost  of  government  gen- 
erally. To  contribute  to  the  solution  of  the  problem  a  comprehensive 
national  investigation  of  financing  public  education  is  under  way.  It  is 
the  purpose  of  this  inquiry  to  reveal,  if  possible,  additional  sources  of 
revenue  for  public  education.  Efforts  have  likewise  been  made  to  se- 
cure financial  support  from  the  national  government  to  the  end  that 
states  may  be  assisted  in  establishing  a  larger  and  more  comprehensive 
program  of  public  education. 

In  the  last  analysis  any  educational  program  must  be  reduced  to  a 
money  basis.  Money  alone  will  not  make  a  school  system  efficient,  but  a 
school  system  cannot  be  made  efficient  without  adequate  appropriations. 
Whether  a  system  of  public  education  of  the  future  shall  produce  better 
results  than  the  school  system  of  the  past  depends  more  largely  upon 
appropriations  for  public  education  than  upon  any  other  single  factor. 

In  an  address  before  the  Department  of  Superintendence  of  the 
National  Education  Association,  in  Chicago,  in  1922,  Mr.  Will  C.  Wood, 
state  superintendent  of  public  instruction  of  California,  undertook  to 
answer  the  question,  "What  has  this  country  reason  to  expect  by  way  of 
culture,  efficiency,  and  good  citizenship  in  return  for  the  cost  of  operat- 
ing public  education?"  I  quote  two  paragraphs  from  Superintendent 
Wood's  address: 

"The  natural  point  of  departure  in  this  discussion  is  the  amount  we 
are  now  expending  for  the  elementary  and  secondary  schools  of  our 
country.  The  figures  for  the  year  ended  June  30,  1920,  show  a  total 
expenditure  for  the  public  schools  by  all  states  of  a  trifle  less  than 
$900,000,000.  With  the  growth  of  population  and  the  upward  trend  of 
wages  that  persisted  in  the  early  part  of  the  last  fiscal  year,  we  may 
safely  assume  that  the  total  expenditure  for  public  education  during 
the  year  ended  June  30,  1921,  was  between  $1,000,000,000  and  $1,100,- 
000,000,  which  is  approximately  $10  for  each  man,  woman,  and  child 
in  the  country.  Let  us  compare  this  expenditure  for  that  year  with  the 
total  expenditure  of  1911,  a  decade  previous.  The  total  expendi- 
ture during  that  year  was  $446,727,000,  or  $4.76  for  each  unit 
of  population.  The  federal  Department  of  Labor  is  authority 
for  the  statement  that  during  the  year  1921  the  purchasing- 
power  of  a  dollar  was  less  than  half  its  purchasing  power 
in  1911.  In  other  words,  the  $10  per  capita  expended  for  public 
education  in  1921  could  purchase  no  more  than  the  $4.76  per  capita 
expended  in  1911.  The  so-called  increases  in  school  expenditures  during 
the  last  decade  have  therefore  not  been  increases  at  all.  School  expendi- 
tures measured  in  purchasing-power  are  no  greater  today  than  they 
were  before  the  war.  After  all  the  agitation  and  enthusiasm  for  educa- 
tion growing  out  of  awakened  recognition  of  the  nation's  dependence 
upon  and  duty  toward  the  youth  it  had  drafted  for  its  defense,  we  find 
that  expenditure  for  schools  has  barely  kept  up  to  the  pre-war   level. 


100  I'.l'LLETIN    OF   THE    EXTENSION     DIVISION 

The  country  as  a  whole  is  spending  for  education  in  1922  in  less  pro- 
portion of  its  total  income  than  in  1911.  It  must  therefore  be  observed 
that  the  schools  are  guiltless  of  the  greed  of  the  profiteer. 

"What  in  the  way  of  culture,  efficiency,  and  good  citizenship  has 
this  country  reason  to  expect  in  1922  in  return  for  the  cost  of  operating 
the  public  schools?  The  answer  is  simple  and  direct:  inasmuch  as  it 
is  spending  no  more  of  purchasing  power  upon  the  schools  in  1922  than 
it  was  expending  in  1911,  it  has  reason  to  expect  no  more  by  way  of 
culture,  efficiency,  and  good  citizenship  than  it  secured  by  the  expendi- 
tures of  1911.  It  has  reason  to  expect  no  more,  but  as  a  matter  of  fact 
it  is  getting  more,  the  increase  being  chargeable  to  the  devotion  of  the 
great  body  of  teachers  and  school  administrators  to  the  cause  of  better 
America." 

I  am  not  an  alarmist.  I  do  not  approve  of  exaggeration  as  a  means 
of  emphasis.  I  believe  in  the  fundamental  soundness  of  our  program 
of  American  education.  Nevertheless,  the  educational  program  of  pre- 
war period,  and  still  less  the  educational  program  made  obviously  neces- 
sary by  the  revelations  of  the  war,  cannot  be  financed  with  the  same  or 
only  slightly  increased  appropriations,  when  the  dollar  is  buying  less 
than  one-half  as  much  education  as  it  purchased  before  the  war. 

OUR   RESPONSIBILITY 

School  men  and  women  in  elementary  schools,  high  schools,  colleges, 
and  universities  must  assume  a  larger  leadership  in  their  respective 
communities,  in  their  states,  and  in  the  nation.  The  teaching  profes- 
sion has  never  begun  to  realize  the  possibilities  which  are  bound  up  in 
its  members,  in  its  prestige  in  state  and  nation,  and  in  its  capacity  for 
public  service.  Thru  the  National  Education  Association  with  its 
100,000  members,  thru  the  various  state  associations  affiliated  therewith, 
and  thru  the  still  larger  number  of  city  and  community  associations, 
the  teaching  profession  has  an  unprecedented  opportunity  for  the  exer- 
cise of  constructive  educational  leadership  in  community,  state,  and 
nation. 

We  must  always  remember  that  at  any  time  we  are  only  one  genera- 
tion removed  from  a  generation  of  ignorant  people.  Close  educational 
institutions  for  a  brief  period  of  one  generation  and  the  next  generation 
must  be  illiterate.  Each  generation  must  provide  the  educational  facili- 
ties necessary  to  prepare  the  next  generation  of  men  and  women  to  sus- 
tain and  improve  our  democratic  institutions.  Whenever  a  democratic 
nation  is  financially  unable  to  provide  an  adequate  education  for  its 
future  citizens,  it  is  not  only  financially  bankrupt  but  it  will  soon  be- 
come bankrupt  educationally,  politically,  and  morally. 

Let  us,  as  members  of  the  great  profession  of  teaching,  consecrate 
ourselves  to  the  task  of  creating  an  intelligent  public  opinion  concerning 
the  importance  of  education  in  a  democracy,  to  the  end  that  the  genera- 
tion of  young  people  now  in  out'  schools,  colleges,  and  universities  shall 
not  be  deprived  of  an  adequate  education,  which  has  gradually  been 
established  as  the  birthright  o."  every  boy  and  girl  in  America. 

t 


Selling  Educational   Research  to  a  Large  City 

School  System 


Robert  H.  Lane,  Assistant  Superintendent  of  Schools,  Los  Angeles,  Calif. 


I  think  I  should  say  by  way  of  preface  that  I  have  come  before 
you  under  false  pretenses.  I  am  not  a  research  man  either  by  training  or 
experience — I  am  simply  a  salesman,  comparable  to  those  knights  of  the 
road  who  sell  you  anything  from  bonds  to  battleships,  or  possibly  more 
nearly  akin  to  their  humbler  brothers  who  peddle  soap  from  door  to 
door  or  who  lie  in  wait  for  you  at  the  office  with  the  world's  best 
literature  complete  in  thirty  volumes,  a  monumental  work  which  no  in- 
telligent man  could  afford  to  be  without. 

I  warn  you  also,  that  this  is  not  a  formal  address  upon  educational 
research.  It  is  merely  the  plain,  unvarnished  tale  of  how  two  men  at- 
tempted single-handed  (an  expression  worthy  of  Sir  Boyle  Roche,  by 
the  way)  to  sell  educational  research  to  a  large  city  school  system  which 
had  gotten  along  without  it,  thank  you,  for  a  goodly  number  of  years. 
Such  a  form  of  exposition  has  at  once  its  virtues  and  its  faults — you 
will  have  to  pardon  a  certain  diffuseness,  a  wearisome  repetition,  and  a 
multiplicity  of  personal  pronouns.  On  the  other  hand,  you  will  gain 
by  hearing  a  recital  of  actual  occurrences,  vivid,  concrete,  and  I  hope 
interesting  and  helpful  to  all  of  you  who,  in  one  form  or  another,  are 
selling  research  to  individuals  or  to  groups  of  persons  interested  in 
education. 

On  the  first  day  of  May,  1917,  Dr.  Arthur  Sutherland  and  myself 
were  transferred  to  the  superintendent's  office,  one  of  us  as  director  of 
the  division  of  psychology  and  the  other  as  director  of  educational  re- 
search. Dr.  Sutherland's  instructions  were  brief  and  definite — he  was 
to  locate  the  feeble-minded  children  in  the  city  schools,  segregate  them 
into  groups,  and  provide  for  them  instruction  by  competent  teachers. 
My  instructions  were  equally  brief  but  hardly  as  definite.  The  director 
of  educational  research  was  to  ascertain  periodically  if  the  elementary 
pupils  in  our  city  schools  were  able  to  compete  on  equal  terms  with 
pupils  of  other  cities  in  the  matter  of  educational  group  tests  standard- 
ized at  that  time.  If  our  pupils  were  able  to  present  a  creditable  show- 
ing, well  and  good;  if  not,  the  director  was  to  devise  methods  and  de- 
vices whereby  the  deficiencies  could  be  most  quickly  overcome.  You  will 
notice  at  once  the  significant  fact  that  the  whole  purpose  of  research 
was  held  to  be  administrative.  It  appeared  to  be  simply  good  policy  and 
good  advertising  to  have  the  city  schools  as  a  whole  make  as  good  a 
record  in  the  educational  world  as  the  schools  of  other  systems.  That 
educational  research  could  be  intimately  connected  up  with  the  needs  of 
individual  pupils,  that  it  could  be  a  mighty  lever  to  move  the  dead- 
weight of  maladjusted  and  retarded  pupils,  was  apparently  not  even 
dreamed  of. 

(101) 


102  Bulletin  of  the  Extension  Division 

It  seemed  to  both  of  us  as  we  planned  our  work  that  we  could  fol- 
low one  of  two  possible  courses:  (a)  We  could  give  thru  the  machinery 
of  the  superintendent's  office  a  series  of  city-wide  group  tests,  using  the 
kind  offices  of  volunteer  workers  from  the  local  universities  and  high 
schools  in  scoring  and  tabulating  the  results  of  the  tests.  Such  a  plan 
would  bring  immediate  returns  in  the  shape  of  records  covering  all  of 
our  schools  with  the  expenditure  of  comparatively  little  time.  From  the 
administrative  standpoint  this  was  the  ideal  course  of  action.  It  had 
the  fatal  defect  of  touching  the  immediate  needs  and  interests  and  daily 
tasks  of  the  classroom  teacher  not  at  all,  except  in  a  most  superficial 
way.  Or,  (b) ,  we  could  work  individually  with  principals  and  teachers, 
giving  our  tests  in  person,  and  scoring  them  and  interpreting  them  so 
as  to  make  constructive  diagnosis  possible.  We  realized  from  the  first 
that  such  a  plan  would  be  painfully  slow  and  that  we  would  have  little 
to  show  in  the  way  of  results  for  a  long  time  to  come,  but  since  we 
were  building  for  the  future,  since  we  had  all  the  years  ahead  of  us  to 
work  in,  and  above  all,  since  we  had  faith  in  the  willingness  and  ability 
of  our  people  to  grasp  this  new,  strange  device  in  education,  once  it  was 
fairly  presented  to  them,  we  decided  upon  a  salesmanship  campaign 
whereby  educational  research  should  be  "sold"  in  person  to  the  prin- 
cipals and  teachers  of  our  schools.  May  I  point  out  to  you,  therefore, 
that  by  and  large  our  sales  campaign  passed  thru  four  distinct  phases, 
the  first  of  which  I  shall  call 

I.  The  House-to-House  Canvassing  Phase.  Heteme  aqui  ya  fuera 
de  Ovicdo,  camino  d<  Penaflor,"  says  Gil  Bias  at  the  beginning  of  his 
travels.  "Behold  me  therefore  just  outside  Oviedo,  on  the  way  to  Pen- 
aflor." Behold  the  speaker,  therefore,  one  bright  day  in  May  armed 
with  a  brief  case  packed  with  the  popular  tests  of  that  day  descend  upon 
a  large  elementary  school  whose  principal  desired  to  know  what  this 
new  educational  device  might  be  and  how  it  worked.  By  chance  I  found 
a  school  composed  largely  of  Jewish  children  who  were  intensely  in- 
terested in  arithmetic  and  who  responded  to  the  Courtis  Tests  in  Arith- 
metic with  great  enthusiasm.  The  tests  were  given  in  person  and 
teachers  and  principal  invited  to  watch  the  proceedings.  The  papers 
were  scored  and  the  results  tabulated  on  the  spot,  and  some  attempt 
was  made  to  interpret  the  variations  in  class  and  individual  ability 
which  were  plain  to  behold.  The  next  day,  another  school  was  visited 
where  reading  was  popular  and  the  then  Kansas  Silent  Reading  Tests 
proved  the  entering  wedge.  For  several  weeks  the  process  was  repeated 
until  a  growing  interest  in  our  tests  led  to  several  invitations  to  visit 
schools  whose  principals  and  teachers  had  not  been  conspicuously 
friendly  to  the  test  idea  at  the  inception  of  the  campaign  but  who  had 
later  become  interested  from  reports  emanating  from  the  schools  origi- 
nally visited.  It  should  be  noted  that  the  salesman  evolved  thru  experi- 
ment a  daily  schedule  which  proved  fairly  satisfactory.  Each  morning 
a  school  was  visited,  tests  given  as  far  thru  the  classes  as  time  would 
permit,  and  the  afternoon  of  the  same  day  was  devoted  to  marking 
papers,  tabulating  results,  and  to  preparing  a  crude  report  which  was 
later  submitted  in  person  to  the  principal  and  his  teachers  so  as  to  es- 


Educational  Measurements  103 

tablish  a  basis  for  profitable  discussion.  By  the  end  of  the  term,  about 
one-half  of  our  schools  had  been  visited,  and  while  the  work  in  any  one 
school  was  painfully  superficial  and  inadequate,  the  idea  was  beginning 
to  "sell"  and  there  was  a  growing  demand  for  our  "goods."  Occasion- 
ally mimeographed  reports  of  combined  results  were  prepared  showing 
that  in  the  main  our  elementary  pupils  were  conspicuously  behind  in 
some  subjects  compared  with  children  elsewhere  wdiile  presenting  a  fa- 
vorable result  in  other  subjects. 

The  steady  increase  in  our  school  population  at  this  time  and  the 
consequent  necessity  for  hastening  our  progress  thru  the  schools  led  to 
the  second  phase  of  our  campaign  which  I  shall  call 

II.  The  Instructional  Classes  Phase.  By  way  of  experiment,  we 
inserted  in  our  local  school  journal  a  modest  notice  to  the  effect  that  we 
would,  at  the  opening  of  the  next  school  term,  inaugurate  a  class  in 
educational  measurement  for  elementary  school  principals  who  desired 
to  know  enough  of  this  new7  idea  in  education  to  experiment  in  their 
schools.  We  fixed  the  membership  of  the  class  at  tw-enty  students,  not 
because  we  wdshed  really  to  restrict  the  size  of  the  class  but  because  we 
had  no  idea  how  many  replies  would  be  received  in  answer  to  our  in- 
vitation, if  any  at  all.  In  other  words  we  acted  on  the  well-known  prin- 
ciple laid  down  by  Charles  Dickens  that  one  of  the  best  ways  to  get  people 
to  attend  a  performance  is  to  impress  upon  them  that  it  is  impossible  to 
get  in.  To  our  surprise  we  enrolled  twenty-four  principals  in  this  first 
venture  of  ours  and  carried  them  thru  a  five  months'  course,  meeting 
once  a  week  after  school.  The  method  of  instruction  was  simplicity  itself 
— our  principals  were  told  to  consider  themselves  elementary  pupils  and 
tests  were  given  to  them  just  as  we  gave  them  to  our  boys  and  girls.  Then 
the  papers  were  marked  and  the  results  tabulated  on  the  blackboard  and 
individual  difficulties  explained  away.  Diagnostic  methods  were  taken 
up,  and  the  members  of  the  class  were  showm  in  a  crude  way  how  to  use 
the  results  of  tests  to  locate  deficiencies  in  instruction  and  administra- 
tion. As  the  available  textbooks  proved  too  difficult,  a  simplified  drill 
in  educational  statistics  was  devised  to  insure  ease  and  accuracy  in 
handling  data.  At  the  end  of  the  first  semester,  a  call  was  sent  out  for 
a  new  volunteer  class  for  the  succeeding  semester,  and  as  at  first, 
the  membership  was  set  at  twenty  pupils.  We  were  gratified  to  enroll 
not  twenty  but  seventy-seven  principals  in  this  new  class  which  re- 
peated the  previous  course,  while  the  earlier  class  continued  as  an  ad- 
vanced section  taking  up  group  intelligence  tests  which  were  just  then 
coming  into  prominence.  I  should  like  to  point  out  that  we  were  ap- 
parently "selling  the  idea"  not  only  to  our  own  people  but  to  the  local 
universities  as  well.  At  the  close  of  that  year  the  demand  for  advanced 
instruction  in  educational  measurement  became  so  pronounced  that  both 
universities  offered  a  variety  of  courses  both  in  summer  session  and 
after-school  classes,  thus  relieving  us  of  any  further  responsibility  in 
the  matter.  The  universities  greatly  strengthened  our  hands  not  only 
by  providing  most  competent  instructors  but  by  offering  university  credit 
toward  a  degree  upon  these  courses. 

I  now  come  to  a  phase  of  our  campaign  which  is  difficult  to  describe 


104  Bulletin  of  the  Extension  Division 

as  we  passed  thru  a  transition  stage  common  to  city  school  systems  at 
periodic  intervals.     For  want  of  a  better  name  I  shall  call  it 

III.  Salesmanship  Thru  Special  Rooms  and  Special  Teachers.  In 
January,  1920,  1  was  transferred  to  the  superintendent's  office  and  Dr. 
Sutherland  was  given  the  new  title  "director  of  the  division  of  psychol- 
ogy and  educational  research",  facing  the  monumental  task  of  doing 
single-handed  the  work  formerly  divided  between  two  men.  Work  in 
the  former  division  of  psychology  had  proceeded  briskly  since  1917  and 
three  types  of  special  rooms  were  devised  to  remedy  existing  inequalities: 
(a)  development  rooms  for  definitely  feeble-minded  children,  (b)  oppor- 
tunity rooms  for  both  border-line  cases  and  an  occasional  super-bright 
child,  (c)  adjustment  rooms.  The  adjustment  room  as  developed  at  Los 
Angeles  calls  for  more  than  passing  notice.  There  are  four  outstanding 
characteristics  of  the  adjustment  room.  (1)  Entrants  must  be  normal 
children  who  have  fallen  behind  in  their  school  work  thru  illness,  ab- 
sence, defective  teaching,  or  other  contributing  cause,  but  they  may  not 
be  subnormal  children  or  disciplinary  cases  or  physically  defective.  In 
other  words,  only  such  children  may  be  entered  in  an  adjustment  room 
as  seem  to  be  likely  material  for  educational  diagnosis  and  capable  of 
profiting  by  the  peculiar  type  of  instruction  devised  for  these  rooms.  (2) 
Pupils  remain  in  the  adjustment  room  for  a  limited  period — usually 
about  sixty  working  days.  (3)  Adjustment  material  consisting  of  mim- 
eographed exercises,  projects,  tests,  and  practice  sheets  from  an  in- 
formal textbook  while  the  official  textbooks  are  used  in  a  supplementary 
capacity.  This  adjustment  material  is  based  on  twelve  levels  of  attain- 
ment from  low  first  to  high  sixth  grades,  each  level  representing  roughly 
a  half-year's  progress  on  the  pupil's  part.  (4)  The  unique  feature  of 
the  adjustment  room  lies  in  the  fact  that  the  pupil  pulls  the  load  while 
the  teacher  acts  in  an  advisory  capacity.  Provision  for  self-initiative 
is  made  thru  the  material  itself  and  thru  fixing  on  the  pupil  responsi- 
bility for  his  own  progress. 

During  the  transition  period  to  which  I  have  referred,  pupils  were 
admitted  to  adjustment  rooms  and  development  rooms  only  after  tests 
were  given  either  thruout  the  several  schools  or  in  part  and  after  eligible 
lists  of  pupils  had  been  prepared.  I  wish  you  to  notice  the  administra- 
tive machinery  whereby  this  testing  was  done.  The  city  was  divided 
into  several  districts,  and  over  each  of  these  was  placed  a  supervisor 
whose  specific  job  it  was  to  (a)  give  such  tests  thruout  the  several  schools 
as  would  determine  what  types  of  special  rooms  were  most  needed,  (b) 
prepare  eligible  lists  of  pupils  for  both  adjustment  and  development 
rooms,  (c)  help  new  adjustment  teachers  to  organize  their  work,  (d) 
supervise  such  rooms  after  organization  until  they  become  autonomous; 
i.e.  until  the  teacher  in  charge  could  perform  her  work  so  efficiently 
that  she  needed  only  casual  and  occasional  supervision.  Please  notice 
that  in  effect  this  scheme  placed  the  research  program  in  so  far  as  tests 
were  concerned  in  charge  of  a  small  group  of  supervising  teachers  and 
removed  it  almost  completely  from  the  jurisdiction  of  the  principals  and 
regular  classrooms.  I  need  not  point  out  that  such  a  form  of  salesman- 
ship had  one  outstanding  virtue  and  one  almost  fatal  defect  as  you  will 


Educational  .Measurements  IOo 

see.    It  had  the  outstanding  virtue  of  centralizing  the  research  program 

in  a  small,  easily-controlled  body  of  specialists  who  reported  directly  to 
their  administrative  head  so  that  a  standardized  procedure  was  easily 
worked  out.  The  fatal  defect  of  the  whole  scheme  lay  in  the  fact  that 
principal  and  regular  teachers  were  practically  ignored.  The  teacher- 
supervisor  came  to  a  school,  gave  tests,  prepared  eligible  lists,  organized 
special  rooms,  and  reported  her  findings  directly  to  her  chief.  The  special 
room  teacher  after  her  work  was  organized  also  reported  directly  to  the 
head  of  the  department,  and  the  principal's  share  in  the  whole  educa- 
tional drama  was  merely  that  of  a  bystander.  This  is  no  reflection  on 
the  efficient  and  often  remarkable  work  done  by  the  department  in  tak- 
ing care  of  the  misfit,  but  it  led  to  a  growing  unrest  among  the  prin- 
cipals who  woke  up  one  morning  to  find  the  testing  program  completely 
removed  from  their  hands.  Such  was  our  situation  up  to  the  beginning 
of  the  last  school  year.  This  brings  me  to  the  fourth  and  last  phase  of 
our  campaign  which  I  shall  call 

IV.  The  School  Counselor  Movement.  I  am  indebted  for  this  solu- 
tion to  our  difficulties  indirectly  to  my  good  friend,  Dr.  Virgil  Dickson 
of  Oakland,  and  directly  to  the  casual  visit  of  an  Oakland  teacher  who, 
in  discussing  educational  research,  said  that  in  Oakland  a  plan  had  been 
devised  whereby  a  teacher  in  each  of  several  large  elementary  schools 
had  been  partially  relieved  of  her  classroom  duties  so  as  to  permit  her 
to  give  tests  and  assist  in  grading  and  classifying  the  schools.  This  plan 
seemed  to  me  to  secure  both  those  desirable  ends  which  hitherto  had 
proved  to  be  unattainable  simultaneously:  (a)  to  place  the  testing 
program  under  such  close  supervision  as  to  make  it  most  effective,  and 
(b)  to  give  principals  and  teachers  such  a  large  share  in  the  work  as 
to  enlist  their  hearty  sympathy  and  support.  Dr.  Sutherland  proved 
most  helpful  in  planning  to  make  the  experiment  in  our  own  system,  and 
as  a  beginning  three  elementary  schools  were  selected  for  the  purpose, 
the  choice  being  finally  made  in  schools  where  at  least  one  teacher  in 
each  school  had  taken  a  rather  unusual  interest  in  experimental  educa- 
tion. Please  notice  that  we  did  not  send  a  teacher  from  the  outside  to 
each  school;  we  selected  a  teacher  in  each  case  who  had  been  in  her 
school  for  some  time,  who  knew  local  conditions  and  needs,  and  who  had 
the  confidence  and  respect  of  her  fellow-teachers  and  her  principal.  I 
should  like  to  point  out  also  that  each  of  these  three  schools  had  an  ex- 
ceptional principal,  one  who  was  interested  in  our  campaign  and  who 
could  intelligently  direct  the  work.  These  three  "school  counselors'"  as 
we  shall  hereafter  call  them  were  given  the  following  instructions: 

"Your  job  is  to  assist  the  principal  to  regrade  and  reclassify  his 
school  to  the  end  that  children  shall  be  grouped  together  who  are  at  or 
near  the  same  educational  level  and  who  are  presumably  capable  of 
making  about  equal  progress  in  their  work.  When  your  survey  is  com- 
plete, go  over  it  with  your  principal,  and  make  such  regroupings  as  will 
prove  mutually  satisfactory  to  both  teachers  and  principal.  If  a  teacher'- 
judgment  of  a  pupil  disagrees  radically  with  the  results  of  your  tests, 
give  the  pupil  the  benefit  of  the  doubt  and  stand  by  the  teacher's  judg 
ment   until   she   is  clearly   proved   to   be   wrong.      Report    your   results   to 

8—26670 


106  Bulletin  of  the  Extension  Division 

Dr.  Sutherland's  office  and  obtain  his  advice  on  knotty  problems  which 
may   arise   from   time   to   time." 

One  of  the  three  school  counselors  had  just  returned  from  a  year's 
work  at  Teacher's  College  and  desired  to  follow  the  plan  of  reclassifying 
di  vised  by  Dr.  McCall.  We  asked  the  two  other  counselors  to  follow  the 
same  procedure  so  as  to  unify  our  work,  and  as  a  result  the  survey 
tests  consisted  of  the  Woody  McCall  Reading,  Thorndike  McCall  Arith- 
metic, National  Intelligence  Test,  and  a  selected  group  of  words  from  the 
Ayres'  Spelling  Scale.  From  the  results  of  these  tests,  together  with  the 
teacher's  judgment,  pupils  were  reassigned  to  the  several  grades  and 
divided  into  X,  Y,  and  Z  groups  for  instructional  purposes.  As  far  as 
possible,  the  Z  groups  in  each  case  were  made  smaller  than  the  X  and  Y 
groups  so  as  to  provide  for  necessary  variations  in  the  course  cf  study 
and  to  admit  of  individual  child-study  and  child-diagnosis.  In  many 
cases,  the  reorganization  of  the  school  led  to  special  promotions,  a  topic 
1  shall  discuss  later.  It  necessitated  also  demotions  in  several  instances, 
but  in  the  main  it  has  been  found  more  politic  to  place  such  pupils  in 
special  rooms  for  observation  and  study  than  to  run  the  risk  of  antagon- 
izing teachers  and  parents  at  the  inception  of  the  new  venture. 

After  the  primary  reorganization  had  taken  place,  the  school  coun- 
selor was  asked  to  devote  her  time  (a)  to  examining  new  pupils  as  they 
entered  the  school  and  to  place  them  properly,  (6)  to  watching  the  re- 
organized groups  to  permit  of  frequent  readjustments  as  need  might 
arise,  (c)  to  suggesting  the  formation  of  one  or  more  special  classes — 
adjustment,  development,  or  opportunity  classes — and  to  assist  in  their 
organization,  and  (<l)  most  essential,  to  studying  at  least  one  Z  group 
to  find  out  why  the  pupils  in  that  group  were  failing  pupils.  It  seems 
to  us  most  essential  that  Z  groups  should  be  regarded  not  as  '"dumping- 
grounds"  for  the  educational  misfits  but  rather  as  educational  clinics 
wherein  observation,  diagnosis,  and  remedial  measures  may  be  carried  on. 

Well,  these  three  pioneers  went  to  work  intelligently  and  systemat- 
ically, handling  fellow-teachers  and  principals  with  consummate  tact. 
The  teachers  were  made  to  feel  that  their  judgment  received  as  much  if 
not  more  consideration  than  the  results  of  the  tests,  and  that  if  an  ob- 
streperous parent  challenged  the  opinion  of  the  school  upon  his  child  the 
school  counselor  was  right  at  the  teacher's  side  to  give  her  moral  sup- 
port in  the  data  secured  from  the  tests.  When  a  working  classification 
into  groups  was  arrived  at,  the  results  were  turned  over  to  the  principal, 
and  he,  and  not  the  counselor,  set  the  necessary  administrative  machin- 
ery into  motion  which  resulted  in  the  desired  reorganization.  If  a  pupil 
showed  plainly  by  his  work  that  he  was  misplaced,  the  counselor  was 
the  first  person  to  suggest  a  new  placement  in  his  case.  And  let  me  not 
forget  to  add  that  in  all  this  adventure,  Dr.  Sutherland  and  his  staff 
gave  us  the  most  loyal  and  wdiole-hearted  cooperation,  and  thru  their 
larger  knowledge  and  wider  experience  saved  us  from  many  bad  blunders. 

This  was  salesmanship  of  a  high  order.  Soon  other  principals  who 
had  been  watching  the  experiment  came  in  with  requests  for  school 
counselors,  and  little  by  little  the  pioneer  band  of  three  grew  slowly 
until  at  the  present  time  we  have  twenty-two  persons  engaged  in  the 
work. 


Educational  Measurements  107 

Certain  interesting  administrative  problems  arising  from  the  school 
counselor  movement  fall  to  my  lot  for  solution.  Some  of  them  were 
solved,  others  are  still  giving  us  considerable  concern.  The  more  ex- 
perienced counselors  came  to  me  soon  after  the  beginning  of  the  present 
school  year,  stating  that  their  preliminary  placements  had  been  made 
and  that  their  services  were  not  needed  at  their  several  schools  for  a 
lew  weeks  until  rechecking  results  would  become  necessary.  We  finally 
worked  out  a  scheme  of  assignments  for  such  counselors  which  attaches 
each  to  a  home  school  where  the  counselor  must  work  at  stated  intervals 
(i.e.  at  the  beginning,  middle,  and  end  of  each  school  term)  and  which 
allows  them  for  the  balance  of  the  term  to  take  short  assignments  in 
other  schools  which  are  not  large  enough  to  support  full-time  counselors, 
but  where  superficial  surveys  are  necessary  and  helpful.  Obviously, 
results  are  not  complete  as  we  would  like  from  these  short  assignments, 
but  we  have  learned  that  the  school  counselor  can  "sell  the  idea"  at  least 
to  a  new  school  and  awaken  a  desire  for  more  extended  work  in  the 
future. 

Another  problem  arose  over  special  promotions.  For  example,  a 
child  is  found  in  the  low  fourth  grade  whose  promotion  age  entitles  him 
to  a  place  in  the  low  fifth  grade,  an  interval  of  one  entire  year.  The 
child  is  promoted  and  fails  to  make  good,  not  that  his  mental  ability  is 
not  equal  to  the  demands  of  his  new  classification  but  that  certain 
definite  steps  in  the  curriculum  of  the  fourth  grade  have  been  omitted 
in  his  case,  and  he  works  under  a  constant  handicap  which  militates 
against  his  success.  This  problem  was  solved  by  placing  such  a  child 
in  an  adjustment  room  either  for  an  hour  a  day,  or  in  some  cases  for 
full-time  for  a  few  weeks  until  the  missing  steps  had  been  taken  and 
mastered. 

The  greatest  problem  of  all  is,  of  course,  to  adapt  the  technique  of 
instruction  in  the  several  groups  so  as  to  secure  the  maximum  results. 
Teachers  come  to  us  constantly  and  say  "What  shall  I  do  to  make  my 
teaching  of  a  Z  group  successful?"  We  have  instructed  the  counselors 
during  this  spring  term  to  make  an  intensive  study  of  one  Z  group— 
to  discover  first  of  all  why  these  pupils  are  Z  pupils.  This  involves  an 
extensive  and  intensive  case-study  of  each  individual  child,  and  we  are 
finding  out  some  rather  extraordinary  things.  In  three  of  these  classes 
we  are  trying  the  experiment  of  administering  modified  adjustment 
materials  for  two  hours  a  day,  and  in  these  groups  we  have  discovered 
that  a  number  of  pupils  who  have  been  failures  in  the  fourth  grade  have 
failed  because  they  could  work  only  at  a  high  first-grade  level  in  one 
cr  more  subjects.  How  much  do  you  suppose  a  fourth-grade  child  can 
profit  by  repeating  the  work  of  the  fourth  grade  when  he  really  needs 
work  at  the  second-grade  level?  So  the  teachers  of  these  three  groups 
have  had  to  grope  painfully  down  and  down  into  the  child's  experience 
until  they  find  the  initial  causes  of  failure,  and  then  to  build  him  up 
from  that  point. 

Another  problem.  I  asked  the  principal  of  one  of  the  schools  where 
the  experiment  is  being  tried  what  he  had  gotten  out  of  his  study  of  his 
Z  group,  and  his  answer  was  that  he  had  been  amazed  to  find  that  many 
of  the  Z  pupils  were  not  Z  pupils  at  all.     They  were,  in  reality,  some 


L08  Bulletin  of  the  Extension  Division 

of  them  Y  and  X  pupils  who  had  been  drilled  into  failure  by  the  blunder- 
ing administrative  machinery  of  an  over-large  city  school  system.     Si  - 
gest  to  a  child  often  enough  that  he  is  a  failure,  and  after  a  while  the 
habit    of    failure   becomes    lixed   and    it   seems   extremely   difficult   to   re- 
move it. 

In  conclusion,  dear  friend.-,  let  me  point  out  that  the  whole  school 
counselor  movement  is  merely  a  means  to  an  end.  We  are  trying  to 
sell  to  our  teachers  the  idea  that  modern  education  means  an  intensive 
study  of  individual  differences  in  children  and  cannot  be  expressed  in 
terms  of  units  of  subject-matter,  pages  covered,  or  lessons  taught.  We 
are  trying  to  sell  the  idea  that  modern  education  implies  changes  in 
conduct;  in  the  formation  of  habits,  appreciations,  and  ideals;  and  only 
in  so  far  as  we  recognize  these  and  attempt  honestly  and  sincerely  to 
measure  them  in  quantitative  terms  are  we  true  to  our  profession.  This 
necessitates  salesmanship  of  a  high  order.  This  necessitates  in  a  true 
and   real   sense,  educational    research. 


The  Use  of  Educational   Research  and  Experi- 
mentation in  School  Administration 


Ellis    U.    Graff,    Superintendent    of    Indianapolis    Schools 


The  problem  of  securing  continuity  of  policy  is  present  in  all  de- 
mocracies where  public  affairs  are  administered  by  lay  boards.  With 
constant  change  of  personnel  this  problem  is  intensified.  Public  school 
administration  in  the  United  States  has  undoubtedly  been  handicapped 
by  frequent  changes  in  boards  of  education  and  their  executive  officers. 
A  recent  study  appearing  in  the  First  Year-Book  of  the  Department  of 
Superintendence  shows  that  the  length  of  service  of  the  superintendent 
in  the  entire  United  States  during  the  period  1900-21  was  three  years. 
This  period  is  not  long  enough  for  the  working  out  of  thoro-going  edu- 
cational policies  and  obtaining  the  results  inherent  in  the  policies.  It 
has  frequently  been  obseiwed  that  schools  which  change  their  adminis- 
trative officers  and  teachers  every  year  do  not  make  much  progress. 
The  application  of  statistical  methods  to  school  administration  affords 
an  opportunity  to  obtain  a  foundation  for  school  policy  which  can  be 
built  upon  by  successive  boards  and  school  executives.  This  is  one  of 
the  most  important  contributions  of  the  statistical  method  to  school  ad- 
ministration. 

Assuming  that  the  purpose  of  this  discission  is  the  citing  of  con- 
crete illustrations  of  the  application  of  statistical  methods  to  administra- 
tive problems  rather  than  a  discussion  of  the  theory  of  statistical  meth- 
ods, allow  me  to  employ  as  my  first  example  the  formulation  of  a  com- 
prehensive building  program  which  was  made  for  our  city  three  or  four 
years  ago.  At  that  time  the  Board  of  School  Commissioners  realized 
that  school  accommodations  were  by  no  means  adequate  to  the  needs  and 
that  building  costs  were  unusually  high,  so  that  it  was  necessary  to 
solve  the  problem  of  additional  housing  facilities  in  terms  of  both  effi- 
ciency and  economy. 

The  first  step  in  the  solution  of  this  problem  was  the  scoring  of  the 
72  existing  buildings  by  the  use  of  the  Strayer  Score  Card  for  City 
School  Buildings.  This  card  covers  all  the  possible  facts  with  reference 
to  the  structure,  cost,  size,  and  service  systems.  It  makes  possible  the 
.-coring  of  the  buildings  on  a  scale  of  1,000  points.  The  result  of  this 
survey  indicated  that  the  buildings  of  the  city  ranked  all  the  way  from 
241  points  out  of  a  possible  1,000  to  803  points,  and  that  29  buildinj 
scored  less  than  500  points,  or  in  other  words,  were  less  than  one-half 
efficient. 

In  addition  to  the  scoring,  we  had  an  appraisal  of  all  the  properties 
made  by  a  qualified  committee,  the  result  of  which  was  the  placing  of  a 
valuation  on  every  piece  of  school  property.  This  enabled  us  to  deter- 
mine  whether   a    uiven   building   should   be   abandoned    or   repaired   and 

HIM 


110  Bulletin  of  the  Extension  Division 

remodeled.  We  also  made  a  study  of  the  distribution  of  persons  of 
school  aye  in  the  city  in  order  to  determine  the  proper  location  of  con- 
templated buildings.  Tables  were  also  made  showing-  the  rate  of  growth 
in  school  population  in  order  to  determine  the  number  of  rooms  neces- 
sary to  provide  for  the  yearly  increase  of  enrollment.  The  result  of  this 
.study  showed  that  the  average  annual  increase  in  the  elementary  depart- 
ment for  a  period  of  fifteen  years  was  slightly  in  excess  of  1,100  pupils. 

In  addition  to  the  above  facts,  we  formulated  and  adopted  a  set  of 
standardized  requirements  for  elementary  school  buildings.  As  a  basis 
for  this  we  used  the  data  furnished  by  the  Committee  on  Standards  of 
the  N.E.A.  and  supplemented  it  by  the  work  of  our  local  architects' 
association  and  a  local  committee  which  visited  and  studied  more  than 
forty  modern  elementary  schools.  The  result  of  this  study  was  the 
adoption  of  a  report  containing  standardized  requirements  which  insure 
the  obtaining  of  buildings  entirely  satisfactory  from  an  educational 
standpoint.  The  requirements  cover  such  points  as  the  percentage  of 
a  leas  to  be  devoted  to  the  various  features  of  school  work,  the  number 
of  rooms  of  various  kinds  required,  the  shape  and  dimension  of  the 
classrooms,  and  the  general  layout  for  the  heating,  ventilation,  and 
sanitary  equipment  of  the  building.  Certain  items  of  interior  finish  are 
also  specified,  such  as  floors,  walls,  ceilings,  wainscot,  etc.  The  adoption 
of  these  requirements  does  not  necessitate  absolute  uniformity  of  result. 
The  architect  is  permitted  to  design  the  building  according  to  his  own 
ideas  so  long  as  he  includes  in  it  the  requirements  contained  in  the 
standard  specifications.  The  result  has  proved  very  satisfactory.  We 
have  obtained  buildings  which  were  uniformly  satisfactory  from  an 
educational  standpoint,  and  yet  there  has  been  a  certain  amount  of 
variety  in  design  and  finish.  A  study  of  the  pamphlet  published  by 
the  Indianapolis  Hoard  of  School  Commissioners  entitled  Standardized 
Requirements  for  the  Construction  of  Elementary  School  Buildings  in 
the  City  of  Indianapolis  will  make  clear  just  how  both  uniformity  and 
variety  are  obtained. 

The  four  types  of  information  above  indicated,  that  is,  building 
survey,  appraisal,  study  of  population,  and  standardized  requirements 
for  buildings,  enabled  the  Board  to  formulate  a  building  policy  covering 
a  period  of  two  years  which  called  for  new  buildings  in  various  parts 
of  the  city  with  a  total  capacity  of  172  classrooms,  and  additions  to  other 
buildings  to  the  extent  of  84  additional  classrooms.  The  report  also 
included  new  installations  of  heating  and  plumbing  with  estimated  costs 
therefor.  The  total  amount  of  the  building  program  as  then  outlined 
called  for  $1,614,398  for  the  first  year's  work  and  $1,552,855  for  the 
second  year's  work,  or  a  grand  total  of  $3,167,253.  These  estimates 
were  based  upon  the  then  current  prices  of  labor  and  materials.  The 
above  items  were,  of  course,  restricted  to  elementary  buildings.  The 
high  school  needs  were  set  forth  in  a  separate  study.  Part  of  this 
projected  program  was  completed  within  the  time  specified,  but  part  of 
it  could  not  be  carried  out  owing  to  excessive  increase  in  building  costs 
and  other  difficulties,  but  within  the  past  six  months  the  Board  has  again 
taken  up  the  problem   of  the  building  program   and  we  have  now  em- 


Educational  Measurements  111 

ployed  architects  and  are  drawing  plans  for  eight  additional  new  ele- 
mentary buildings  and  several  additions  which  will  be  started  this  sea- 
son. These  new  buildings  are  not  exclusively  the  ones  outlined  by  the 
first  building  program,  but  the  need  for  them  was  arrived  at  by  refer- 
ence to  the  same  facts  which  were  used  oiginally.  The  only  modifica- 
tions were  occasioned  by  new  developments  in  the  city,  the  increase  of 
school  enumeration  in  certain  sections  due  to  the  building  up  of  subur- 
ban communities,  and  similar  facts  which  could  not  be  foreseen.  It  is 
very  significant  that  the  type  of  study  indicated  furnished  facts  as  the 
basis  of  a  building  program  which  removed  the  whole  subject  from 
the  realm  of  guess  work  on  the  one  hand  or  of  petition  or  public  clamor 
on  the  other.  It  has  possessed  the  splendid  advantage  of  satisfying 
the  community  that  the  problem  was  being  solved  in  the  light  of  the 
fullest  information,  and  that  the  ultimate  outcome  of  the  whole  program 
would  be  an  adequate  and  satisfactory  building  equipment  for  the 
.--chools   of  Indianapolis. 

FINANCIAL   PROBLEMS 

Another  type  of  study  may  be  of  interest  for  its  bearing  on  the 
question  of  school  finance.  One  of  the  fundamental  needs  in  any  or- 
ganization is  a  knowledge  of  costs  of  maintenance  and  operation.  The 
modern  scientific  budget  and  scientific  accounting  systems  enable  us  to 
keep  as  careful  cost  finding  systems  as  can  be  found  in  any  commercial 
enterprise.  A  splendid  contribution  to  this  department  of  school  ad- 
ministration has  been  made  by  the  research  division  of  the  Intercity 
Conference  which  prepared  an  accounting  system  which  has  been  adopted 
by  the  National  Department  of  Superintendence,  the  National  Associa- 
tion of  Business  Officials,  and  the  United  States  Bureau  of  Education. 
Accompanying  the  system  is  a  blank  form  of  reporting  which  is  or- 
ganized to  correspond  with  the  plan  of  keeping  school  accounts.  This 
blank,  a  copy  of  wdiich  I  have  here,  reclassifies  and  redefines  the  various 
types  of  school  activity  so  that  there  is  no  overlapping  and  no  confusion. 
Heretofore  it  has  been  difficult  to  differentiate  between  instructional, 
administrative,  and  supervisory  activity.  This  report  attempts  to  define 
all  such  functions  so  that  the  expenditures  are  charged  to  the  correct 
items.  The  gain  in  this  plan  is  twofold:  it  enables  a  school  system 
to  know  its  own  situation  exactly  and  also  it  puts  it  upon  a  basis  which 
is  exactly  comparable  with  other  school  systems. 

As  an  example  of  a  cost  finding  study,  permit  me  to  quote  from 
a  report  recently  prepared  in  regard  to  one  of  our  own  high  schools 
covering  the  first  semester  of  1922-2:1  This  study  shows  that  there 
were  1,984  students  enrolled  in  119  courses,  not  including  music,  physical 
training,  expression,  and  military  training.  The  average  number  of 
courses  per  pupil  was  4.3.  The  average  total  enrollment  per  class  was 
22.2.  The  computation  of  pupil  hour  cost  in  the  report  was  based  on 
the  number  of  pupils  remaining  at  the  end  of  the  semester  as  reported 
by  the  teacher.  A  pupil  hour  of  instruction  is  a  sixty-minute  period  of 
regularly  assigned  recitation  or  laboratory  time.  Since  the  class  periods 
are  in  multiples  of  forty  minutes,   a  pupil   enrolled   in   a  single   period 


112  Bulletin  of  the  Extension  Division 

subject  for  ninety  days  would  require  sixty  double  hours  in  instruction 
per  semester.  The  total  number  of  pupil  hours  of  instruction  required 
for  all  pupils  was  501,620,  or  an  average  of  65.9  per  subject,  or  295.9 
per  pupil.  For  teachers'  salaries  alone  and  considering  only  the  time 
spent  in  conducting  classes  $94,531.75  was  required.  The  percentage  of 
failures  was  12.7,  so  that  approximately  $12,000  worth  of  instruction  did 
not  bring  results  as  far  as  successful  work  was  measured  by  school 
records.  The  expenditure  for  administration  in  this  school  for  this 
semester  was  $11,103.31  or  $5.59  per  pupil.  For  supervision  the  cost 
was  $5,673.34  or  $2.86  per  pupil.  The  expenditure  for  instruction  as 
divided  among  the  various  subjects  was  as  follows: 

English    2o.2  cents  per  pupil  hour 

Commercial    14.2  cents  per  pupil  hour 

Mathematics    15.5  cents  per  pupil  hour 

History    16      cents  per  pupil  hour 

Foreign   Languages   19.5  cents  per  pupil  hour 

Science    19.1  cents  per  pupil  hour 

Shop,    Mechanical    Drawing 24      cents  per  pupil  hour 

Art 21.1  cents  per  pupil  hour 

Domestic  Art 18.8  cents  per  pupil  hour 

These  facts  show  that  the  costs  vary  from  21  cents  to  14.2  cents  per 
pupil  hour.  The  cost  per  pupil  hour  for  the  whole  school  was  18.8 
cents.  A  careful  study  of  these  figures  enables  us  to  adjust  the  teach- 
ing load  fairly  to  the  teaching  stabilized  to  base  the  class  enrollment 
upon  a  figure  which  will  be  economical.  It  also  prevents  a  dispropor- 
tionate amount  being  expended  for  supervision  or  other  activities  out- 
side of  instruction  proper.  Of  course,  it  goes  without  saying  that  cost 
is  not  the  only  nor  indeed  the  major  factor  to  be  considered  in  educa- 
tion, but  no  doubt  all  will  agree  that  the  costs  should  be  known,  and 
decisions  as  to  the  undertaking  of  various  types  of  activity  should  be 
made  in  the  light  of  a  full  knowledge  of  the  financial  elements  involved. 

READING   STUDY 

An  example  of  the  value  of  tests  and  standards  in  the  teaching  of 
read  inn  will  be  shown  by  the  following  results  obtained  in  our  schools 
by  the  use  of  reading  tests.  As  shown  by  Chart  I,  the  tests  given  in 
1919  revealed  the  fact  that  our  results  in  comprehension  were  slightly 
below  the  standard  in  all  grades.  Conferences  were  held  in  an  effort 
to  determine  the  reasons  for  the  difficulty,  and  among  other  considera- 
tions was  the  question  of  the  amount  of  time  devoted  to  the  subject. 
Chart  II  was  prepared  showing  the  relation  of  the  time  spent  in  read- 
ing in  the  Indianapolis  schools  to  that  in  fifty  cities.  You  will  see  from 
this  chart  that  the  time  in  Indianapolis  is  considerably  less  than  the 
average  of  the  fifty  cities.  Thru  repeated  conferences  and  a  careful 
consideration  of  methods,  results  were  greatly  improved,  and  the  tests 
for  the  first  semester  of  this  year  show  that  our  results  were  above 
standard  in  all  grades.  This  was  obtained  without  increasing  the  time 
devoted  to  the  subject.  Chart  III  shows  a  corresponding  situation  with 
reference  to  the  speed   in   reading. 


Educational  Measurements 


11° 


o 


This  study  was  largely  diagnostic.  It  is  frequently  the  case  that 
the  correction  of  a  condition  is  not  difficult  when  the  real  status  of  the 
case  is  brought  to  light.  Of  course,  if  special  adjustments  or  methods 
are  needed,  they  can  be  provided  as  soon  as  the  exact  needs  are  known. 


3B  J 

a    v 

R    V 

0    s 

/?   ^ 

A      6 

8     6 

<?       1/i       -JGL     5T 

a    ^"a. 

IS 

ftf 

s' 

/3 

* 
/ 

> 

n 

■ 

_^ 

/ 
/ 

■ 

rf 

y/i 

> —  "" ' 

J 

/ 

_£ 

X       ' 

t  —"  "" 

/ 

7 

/ 

T 

/  g      t 

t 

1  O  A  I 

...     / 

?^j 

// / 

I  h    r/^^J 

/  <%  i/ 

^ 

-/-/ 

/ 

. 1 

•• 

0 

V 

J 

J 

/ 

(HART    I 
Showing   the  achievements   of   the    Indianapolis    Public    Schools    in    comprehension    in 
April,    1920.    and    in    January.    l!)^:i.    compared    with    an    estimated    standard.      The    tests 
used    were  those   devised    bj    Monroe   to   test    speed   and   comprehension    in    s:len1    reading. 


114 


Bulletin  of  the  Extension  Division 


O 


Educational  Measurements 


115 


3 

$ 

S 

a    * 

a  9 

tf    vf 

a    6 

ti    6 

<?    ?a  ?<i   fb   fa 

'?o 

170 
/be 

/90 

^T 

/ 
/ 

X 

^,* 

* 

S  / 

■  ^^ 

s 
' 

/JO 

f 

JZ(L 
/ra 

'  / 

• 

/ 

1 

f  / 

> 

ibo 

9jl 

//    * 

/ 
/ 

— -  «•  ^ 

rApr 

r/    >9zo 

-£o 

7o 

. 

CHART  III 
Showing-    achievement    of    Indianapolis    Public    Schools    in    rate    of    silent    reading    in 
April,    1920,   and    January,    1923,   compared   with   a  standard.     The   tests    used   were   those 
devised    by    Monroe   to   test    rate    and    comprehension    in    silenl     reading. 


116  Bulletin  of  the  Extension  Division 

The  use  of  statistical  methods  in  administrative  problems  is  so  fre- 
quent that  it  would  he  impossible  in  a  paper  of  this  sort  to  cover  very 
much  of  the  field.  Very  few  problems  in  administration  are  solved  with- 
out the  use  of  statistical  methods,  and  the  department  of  research  is  one 
of  the  busiest  places  in  the  modern  school  office.  Of  the  Tightness  of  an 
opinion  of  Dr.  H.  O.  Rugg,  I  think  there  can  be  no  doubt.  In  the  pref- 
ace of  his  book,  Statistical  Methods  Applied  to  Education,  he  says, 
"This  book  is  based  upon  the  doctrine  that  statistical  methods  in  them- 
selves prove  nothing, — that  the  methods  selected  for  use  in  a  particular 
situation  must  agree  with  the  logic  of  that  situation:  in  a  word,  that 
statistical  methods  are  merely  quantitative  devices  which  we  can  use  to 
refine  our  thinking  about  complex  masses  of  data  and  to  refine  our 
methods  of  expression." 


The  Use  of  Educational  Research  and  Experi- 
mentation  in   School  Administration 


A.  C.  Sknour,  Director  of  Department  of  Measurements,  Easi  Chicago 

find.)   Schools 


School  service  must  commence  and  terminate  with  the  teacher. 
What  the  classroom  director  does  for  those  under  her  charge  ultimately 
is  the  measure  of  the  worth  of  the  school  enterprise.  That  is  not  ques- 
tioned. Any  discussion  of  the  use  in  school  administration  of  any  pro- 
gram or  movement  that  does  not  consider  its  value  to  or  influence  upon 
the  teacher  necessarily  fails  of  completeness.  Perhaps  it  misses  the 
mark  entirely. 

In  discussing  the  use  of  educational  research  and  experimentation 
in  school  administration  I  choose,  therefore,  as  of  major  importance  the 
value  to  the  teacher.  Experimentation  is  stimulating.  It  calls  for  crit- 
ical analysis  of  past  performances,  a  weighing  of  procedures,  an  alert- 
ness as  to  steps  to  be  taken,  an  anticipatory  state  of  mind  concerning 
outcomes,  and  concludes  with  a  feeling  of  happiness  from  having  at- 
tempted the  development  or  creation  of  something.  Experimentation 
is  an  antidote  for  stagnation,  a  preventive  of  excessive  routinizing. 

Last  fall  in  East  Chicago  an  experiment  was  undertaken  to  deter- 
mine the  relative  value  of  two  different  methods  of  language  presenta- 
tion. A  teacher  who  was  ordinarily  successful  but  was  not  especially 
enthusiastic  in  teaching  language  was  selected  to  conduct  the  experi- 
ment. The  chief  result  of  her  work  may  be  stated  in  her  words:  "I 
formerly  felt  the  language  period  a  burden.  I  look  forward  to  it  now 
with  pleasure."  This  was  volunteered  one  day  in  discussing  the  experi- 
ment with  her.  Of  what  inestimable  worth  to  a  class  is  this  enthusiasm 
of  the  teacher! 

The  experiment  which  the  University  with  the  cooperation  of  a 
number  of  the  teachers  thruout  the  state  has  recently  concluded  was 
similarly  productive  in  East  Chicago.  Our  reading  in  the  intermediate 
grades  is  taught  departmentally  by  an  especially  capable  and  earnest 
group  of  teachers.  But  even  they,  and  thru  them  the  children,  have 
profited  as  a  result  of  the  increased  vigor  and  stimulation  to  more 
searching  inquiry  which  the  experiment  generated. 

Another  important  use  of  research  to  the  administrator  is  the  find- 
ings and  their  applications.  Educational  advance  is  possible  thru  theo- 
rizing, but  that  advance  is  more  sure,  with  less  chance  for  necessity 
for  retreat,  if  the  theorizing  is  substantiated  by  actual  experimentation. 
The  public  schools  must  rely  extensively  upon  the  universities  as  re- 
search centers  for  progressive  ideas  of  organization  and  instruction. 
These  institutions  have  a  genuine  obligation  which  they  are  appreciating 
more  and  more.     Very  many  of  the  recent  scientific  determinations  of 

(117) 


L18  Bulletin  of  the  Extension  Division 

methods  of  teaching  the  various  subjects  are  their  products.  Estimate 
the  value  to  the  administrator  of  the  clean-cut  statements  for  him  of 
the  principles  of  method  in  instruction  in  spelling,  or  in  reading',  that 
have  been  determined  largely  by  university  research.  How  much  more 
definite  the  course  of  study  constructor  can  be! 

The  researches  by  the  educational  psychologists  in  the  field  of  in- 
telligence measurement  have  provided  the  administrator  with  instru- 
ments which  are  gradually  culminating  in  altered  organizations.  Ad- 
ministrators have  recognized  for  years  the  wide  range  in  ability  of  the 
children  grouped  into  a  class  for  educational  purposes.  Nevertheless, 
hut  little  was  done  until  very  recently  to  ameliorate  the  situation.  The 
administrator's  use  of  the  research  product,  the  intelligence  tests,  to 
reclassify   pupils   into  more  homogeneous  groups  is  a  beginning. 

At  East  Chicago  classifications  into  ability  groups  on  the  basis  of 
intelligence  testing,  supplemented  by  achievement  test  scores,  teachers' 
ranking.-,  etc.,  has  been  begun.  All  our  primer  pupils  and  those  entering 
our  junior  high  school  were  classified  last  September  and  again  in  Feb- 
ruary. While  we  have  no  objectively  obtained  data  as  to  the  benefits 
in  the  first  grade,  we  do  have  the  teachers'  recommendations  for  the 
continuance  of  the  work  on  the  strength  of  the  increased  service  they 
have  been  enabled  to  render  to  those  under  theii  charge.  In  the  sev- 
enth grade  in  arithmetic  fundamentals,  all  sections,  the  strongest  to  the 
weakest  in  ability,  made  more  than  the  normal  gain,  the  strongest  ex- 
ceeding the  normal  gain  by  nearly  100  per  cent;  the  weakest,  by  ap- 
proximately H">  per  cent.  East  Chicago  contemplates  the  continuation 
of  this   multiple-track  organization   until   all   grades  are  so   established. 

Classifications,  however,  produce  a  new  problem — that  of  curricula!- 
adjustments.  It  is  at  this  stage  that  the  administrator  must  again  em- 
ploy research  and  experiment.  What  differentiations  in  methods  and 
materials  must  be  made?  The  answer  can  be  given  only  from  the  find- 
ings of  experimentation.  We,  at  East  Chicago,  are  planning  an  attack 
upon  the  problem.  I  conceive  the  situation,  tho,  as  one  which  might 
well  be  undertaken  on  a  more  extended  scale.  Is  the  problem  not  one 
of  such  coming  universal  importance  that  the  more  progressive  schools 
of  the  state  can  cooperatively  consider  ii  under  the  leadership  of  the 
University  Bureau  of  Cooperative  Research  or  of  the  State  Department 
of  Education?  Let  us  ponder  over  this  suggestion,  not  merely  assent 
or  dissent. 

In  conclusion,  I  desire  to  mention  one  of  the  difficulties  encountered 
in  the  use  of  research,  one  that  will  never  entirely  be  overcome  owing  to 
human  nature.  It  is  the  false  impression  which  too  frequently  has  been 
implanted,  perhaps  not  intentionally,  by  teacher  training  institutions 
that  methods  and  devices  which  are  discussed  and  adopted  by  teachers 
m  training  are  the  desirable  ones  and  that  others  must  not  be  considered. 
The  hesitancy  on  the  part  of  teachers  to  accept  different  standards  of 
work  from  sections  or  individuals  of  differing  abilities  is  an  illustration. 
Our  training  institutions  must  strive  to  overcome  this  attitude  in  the 
teachers  now  in  training. 


